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ABSTRACT 

We present a comprehensive treatment of the pixel-lensing theory and apply it to lensing experiments and their 
results toward M3 1 . Using distribution functions for the distances, velocities, masses, and luminosities of stars, 
we derive lensing event rates as a function of the event observables. In contrast to the microlensing regime, in the 
pixel-lensing regime (crowded or unresolved sources) the observables are the maximum excess flux of the source 
above a background and the full width at half-maximum (FWHM) time of the event. To calculate lensing event 
distribution functions depending on these observables for the specific case of M31, we use data from the literature 
to construct a model of M31, reproducing consistently photometry, kinematics and stellar population. We predict 
the halo- and self-lensing event rates for bulge and disk stars in M3 1 and treat events with and without finite source 
signatures separately. We use the M3 1 photon noise profile and obtain the event rates as a function of position, 
field of view, and S/N threshold at maximum magnification. We calculate the expected rates for WeCAPP and for a 
potential Advanced Camera for Surveys (ACS) lensing campaign. The detection of two events with a peak signal- 
to-noise ratio larger than 10 and a timescale larger than 1 day in the WeCAPP 2000/2001 data is in good agreement 
with our theoretical calculations. We investigate the luminosity function of lensed stars for noise characteristics of 
^■f^ , WeCAPP and ACS. For the pixel-lensing regime, we derive the probability distribution for the lens masses in M31 

^ ■ as a function of the FWHM timescale, flux excess and color, including the errors of these observables. 

CO \ Subject headings: dark matter — galaxies: halos — galaxies: individual (M31) — gravitational lensing — Local 

CN . Group 

o . 

; 1. INTRODUCTION 

^^Searches for compact dark matter toward the Large and Small Magellanic Clouds (LMC and SMC) and the Galactic bulge identified 
^_j^numerous microlensing events in the past decade (MACHO, Alcock et al. 1997; EROS, Aubourg et al. 1993; OGLE, Udalski et al. 
O^OOO; DUO, Alard & Guibert 1997). In parallel to these observations, a lot of effort has been spent on the prediction of the number, the 
I spatial distribution, the amplitude, and the duration of lensing events toward these targets. The underlying models require knowledge 
Oof density and velocity distribution, as well as of the luminosity and mass function of lensing and lensed stars. The halo MACHO mass 
^—(fraction and lens mass are free parameters. From that, the contributions of self-lensing and halo-lensing is obtained. The self-lensing 
^predictions (minimum lensing that has to occur due to star-star lensing) serve as a sanity check for observations and models. An excess 
. .of lensing relative to self-lensing can then be attributed to halo lensing, from which the MACHO parameters are finally inferred. 
>Paczynski (1986) was the first to present such a lensing model for the Galaxy halo and to estimate the probability of lensing (i.e., a 
^magnification larger than 1.34) taking place at any time. This probability is also called the microlensing optical depth. On the basis 
^of this work Griest (1991) evaluated the optical depth with more realistic assumptions on halo density and velocity structure. He also 
j^obtained the event rate and distributions for lensing timescales and amplifications. Alcock et al. (1995) related the Einstein timescale 
■ "distribution of the events to the microlensing rate and optical depth. They evaluated these distributions for several axisymmetric 
disk-halo models in the framework of the MACHO project. 

Any microlensing light curve can be characterized by the maximum magnification, the time to cross the Einstein radius (Einstein time) 
and the time of the event. The first two observables depend on the line-of-sight distance of the source and lens, the minimum projected 
transverse lens-source distance (impact parameter), transverse lens-source velocity, and lens mass. These quantities therefore cannot 
be extracted separately from an individual lensing event; instead, one can only derive probability distributions for them (see de Rujula 
et al. 1991 and Dominik 1998). Most interesting are of course the object masses responsible for the measured lensing light curves: 
Jetzer & Masso (1994) have derived the lens mass probability function for an event with given Einstein time and amplification. Han 
& Gould (1996b) have determined the MACHO mass spectrum from 51 MACHO candidates using their observed Einstein times. 
Blending has proven to be a severe limitation in the analysis of microlensing events. It can be overcome partly by using low-noise, 
high spatial resolution Hubble Space Telescope (HST) images for measurements of the unlensed source fluxes (see Alcock et al. 
2001a). For extragalactic objects, however, this can provide a precise source flux for a fraction of lensed stars only. 
One can also use an advanced technique called difference imaging analysis, which is insensitive to crowding and allows to measure 
pixel flux differences in highly crowded fields at the Poisson noise level. Therefore, lensing searches could be extended to more 
distant tai-gets like M31 (AGAPE, Ansari et al. 1999; Columbia- VATT, Crotts & Tomaney 1996; WeCAPP, Riffeser et al. 2001; 2003; 
POINT-AGAPE, Paulin-Henriksson et al. 2003, Calchi Novati et al. 2005; MEGA, de Jong et al. 2004; SLOTT-AGAPE, Bozza et al. 
2000, Calchi Novati et al. 2003; NMS, Joshi et al. 2001), or M87 (Baltz et al. 2004). 

Gould (1996b) called microlensing of unresolved sources "pixel-lensing". This definition encompasses surveys at the crowding limit 
as well as extragalactic microlensing experiments (e.g., toward M31 or M87) where hundreds of stars contribute to the flux within 
1 pixel. Gould (and also Ansari et al. (1997)) showed that the comparison of pixel fluxes at different epochs can extend the search 
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for microlensing events up to distances of a few megaparsecs. Applying his equations Gould (1996b), Han (1996) and Han & Gould 
(1996a) obtained the optical depth and distributions of timescales and event rates for a pixel-lensing survey toward M31. If one does 
not know the flux of the unlensed source accurately (i.e., if one is not in the classical microlensing regime anymore), the information 
that can be extracted from light curves is reduced. 

Wozniak & Paczynski (1997) were the first to note that the light curve maximum does not provide the maximum magnification of the 
source anymore, and, second, one cannot obtain the Einstein time from the FWHM time of an event (since the latter is a product of 
the Einstein time and a function of the magnification at maximum). This initiated efforts to deal with the lacking knowledge of the 
Einstein timescales in the pixel-lensing regime (see Gondolo (1999); Alard (2001)) and the suggestion to extract the Einstein time 
using the width of the "tails" of the lensing light curves by Baltz & Silk (2000) and Gould (1996b). 

However, it is more straightforward to compare quantities that one can easily measure in an experiment with model predictions for the 
same quantity. The two independent and most precisely measurable observables are the flux excess of the light curve at its maximum 
and its FWHM timescale. Baltz & Silk (2000) followed that strategy and derived the event rate as a function of the FWHM timescale 
of the events. We proceed in that direction and calculate the contributions to the event rate as a function of the event's FWHM time 
and maximum excess flux, because both the excess flux and timescale determine the event's detectability. 

The definition of Gould for pixel-lensing may imply that a pixel-lensing event should be called a microlensing event, if its source has 
been resolved (e.g., with HST images) after the event has been identified from ground. Analogously, one could feel forced to call 
a microlensing event a pixel-lensing event, once it has turned out that "the source star" is a blend of several stars, and therefore the 
source flux is not known. Therefore, the classification of an event as a pixel-lensing event or a microlensing event is not unique. 
One can take the following viewpoint: the physical processes are the same, and therefore classical microlensing is a special case 
of pixel-lensing, in which the source flux probabiUty distribution is much more narrow than the stellar luminosity function, i.e., the 
distribution function used in the pixel-lensing regime. The two methods only differ in how to analyze a light curve and how to derive 
the probability distribution for the source flux: One can make use of a noisy and potentially biased baseline value of the light curve 
(hence, stay in the classical microlensing regime), or ignore the baseline value and obtain a source flux estimate from the wings of the 
light curve (analyze the difference Ught curve). Other possibilities are to obtain the source flux from an additional, direct measurement 
or to constrain its distribution by theory. After having determined the source flux probabiUty distribution by one of these methods, 
one can use the formalism described in this paper to derive, e.g., the lens mass probability function. 

Our paper is organized as follows: We introduce our notation for the microlensing and pixel-lensing regime in § 2. We also describe 
the treatment of finite source effects and how to extract the observables from the light curves. In § 3 we combine the probability 
distributions for location, mass, source-lens velocity and impact parameter distribution to obtain the lensing event rate distribution as 
a function of these parameters. Section 4 summarizes the statistical properties of the source populations, i.e., luminosity function, 
number density, color-magnitude and luminosity-radius relation. In § 5 we calculate the optical depth and the observables in the 
microlensing regime: single-star event rate, amplification distribution of the events, Einstein timescale distribution, and FWHM 
distribution of the events. Section 6 deals with the pixel-lensing regime. We calculate the event rate as a function of the maximum 
excess flux and FWHM time (and color) of the event in the point-source approximation. We also show how the event rate changes, if 
source sizes (shifting events to larger timescales and smaUer flux excesses) are taken into account. In § 7 we obtain the event rate for 
pixel-lensing surveys with spatially varying photon noise (related to the surface brightness contours of M31) but fixed signal-to-noise 
threshold for the excess flux at maximum magnification. We predict the number of halo- and self-lensing events in the WeCAPP 
survey (without taking into account the sampling efficiency of the survey) for the M31 model presented in § B. We demonstrate 
that accounting for the minimum FWHM of the events is extremely important to correctly predict the number of events and the 
luminosity distribution of the lensed sources. We also compare the characteristics of self-lensing events with halo-lensing events. 
Finally § 8 derives the lens mass probability distribution from the observables and errors as obtained from light curve fits. The paper 
is summarized in § 9. In Appendix A we motivate an alternative event definition. In Appendix B we describe and construct ingredients 
of the M3 1 lens model, which we use throughout the paper to calculate examples and applications. 



In this section we summarize the basics of microlensing theory and introduce our notation. The change in flux Af (f) caused by a 
microlensing event depends on the unlensed flux Fq and the magnification A(;): 



For a pointlike deflector and a pointlike source moving with constant relative transversal velocity Vt, the amplification is symmetric 
around its time of maximum to and is connected to the Einstein radius and the impact parameter b as follows (Paczynski 1986): 



2. BASICS OF LENSING BY A POINT MASS 



AKO :=Fo[A(0-l]. 



(1) 



A{u{t)) = 



+ 2 M<i 1 



(2) 



m\/m^-h4 " 




(3) 




(4) 



where M is the mass of the lens, D^i and Dqs are the distances to the lens, and r(t) is the distance between source and lens in the lens 
plane. 
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With the Einstein timescale^ Ie '■= ^ and the normalized impact parameter uq := 4- we obtain 



The maximum amplification (at t = to) becomes 



m(0 = 



Ao := 



+ 2 H0<1 1 



-\/mo^+4 



Mo 



Equation (2) can be inverted to 



u(A) = 



2A(A2-1)-'/2_2 



Mo 

1/2 A>1 1 
~ A' 



Inserting Ao in equation (7) its derivative can be written as 
duo _ 2 [(Aq^- 1)-V^- 1/2Ao(Ao'- ir^/^ZAo] 



<iAo 



2[2Ao(Ao2-l)-V2-2; 



1/2 



,(Ao'-l)V2 



(Ao'-iy 



-1/2 



-x^[Ao + (Ao^-iy/^] 
2 (Ao^- 1)5/4 



1/2 



The FWHM timescale ^fwhm of a hght curve is defined by A ( '-^^ ) - 1 := ^ • K is related to the Einstein timescale tE by 

^FWHM = tEW(Uo) = ?eT(Ao), 

where w(mo) was first obtained by Gondolo (1999)^: 

w(Mo) :=2^m(^)'^=2^ 

and T(Ao) := w(m(Ao)) is 



2[A(«o)+l] 
V[A(Mo)-l][A(«o)+3] 



1 2 ""S^ /To' 

-2 — Mo ~ v12mo, 



T(A„) =2\/m 



-m(Ao)' = \/8 



[(Ao+n'/^-AoCAo+S)'/^] " Ao>l 



[(Ao-l)(Ao+l)(Ao+3)li.« 



(5) 



(6) 



(7) 



(8) 



(9) 



(10) 



(11) 



Hence, the easy measurable timescale fpwHM is a product of the quantity Ie, which contains the physical information about the lens, 
and the magnification of the source at maximum hght Ao. 

2.1. Finite Source Effects 

If the impact parameter of a source-lens system becomes comparable to the source radius projected on the lens plane i?* the point- 
source approximation is not valid anymore. The amphfication then saturates at a level below the maximum magnification in equation 
(6). 

The finite source light curve for extended sources can be derived for a disk-hke homogeneously radiating source, 



A*(M) = 7 /a((«2+|,-2„^cos^)'^')-^ 



2-K 1/z 







(12) 



jA(u{\+q^-2qcosef'^')qdqde 

{ \-rci~-2q co^ 0)-r2i u~ 





, 27r l/z 



qdqdO 



^(1+9^-2? cos ef-Hi/u\Uq^-2q cos 61) 

with source-lens separation r{t) and where the definitions 

z(t) := m(0 = 

and q := - have been inserted. For high magnifications, where A(m) w m"' is a valid approximation, equation (12) becomes equivalent 
to Gould"(1994b, eq. (2.5)). 

The maximum amplification in the finite source regime then becomes 



^0 = I Air/RE)-ff^. 

""V^'D^) 



. 2 %o. 



,2a„*»1 



2 Re Dps 
R,Doi 



(13) 



^ Griest (1991) defines the event duration fe as the time span where the lens is closer than a relative impact parameter uj to the source. This can he converted to the 

Einstein timescale using tg = 2(e ^/ uj^ — Uq^, cos e = (1 - j(oV«T^) '''^ Baltz & Silk (2000) use a different definition for the Einstein timescale ; for comparison use 
?E = 2fE in their formulas. 

3 With P = uo and 5(/3) = Aq- 1. 
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Fig. 1 . — Amplification A(z) versus z{t) := [r(()/i?»] (Oos/foi) plotted for R* /£>os = 0.5 (Se/£'oi) ■ Black curve: point-source approximation, see equation (2). Blue 
curve: finite source magnification A*(z) for a homogeneously radiating disk of size R*, exact solution, see equation (12). Red dashed curve: simple approximation 
A*(z) for finite source effects according to equation (15). Blue dots: finite source size approximation in the high-magnification regime, introduced by (Gould 1994b, 
eq. (2.5)). 

which equals the approximation of Baltz & Silk (2000, eq. (19)) for high amplifications. 

For small source-lens distances with Dqi w Don (e.g., for bulge-bulge self-lensing) the above relation becomes 



1-1-1.5 X 10^ 



M 



£>os-£)oi 
Ikpc 



For a source radius of supergiants of R^, sa 2QQRq a source-lens distance of 1 kpc, and a lens with M = IMq finite source effects 
aheady arise above a magnification of Aq « 6.2. For smaller masses M = O.IM0 finite source effects become important even at a low 
magnification Aq w 2. Although typical source radii are smaller, this example shows that finite-source effects cannot be neglected. We 
will show in § 6.3 and Table 7.3 that indeed a large fraction of the M31 bulge-bulge lensing events will show finite source effects. 
Figure 1 shows that for u <Uq (or z~ i) with 



«o '■= "('^o) = S 2 



1-1- 



R* Dqi 
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1/2 



1/2 



Ao*»i R^D, 



ol 



2ReDo 



(14) 



the amplification is no longer directly connected to the source-lens separation (Gould 1995), but all u < Mq have nearly the same 
amplification equal to the point-source approximation A(u) at Mq . Therefore we generalize equation (2) to approximately account for 
finite-source effects 



A*(u) 




u <u, 







u> u, 



0- 



For light curves with finite source signatures (mq < Mq) at an impact parameter 



(15) 



M (1 + (AS - l)/2) ' « ' (R, Doi)/(Re Dos) 

(or z w 1) the amplification of our approximation is half of the maximum and can be used to define the f^HM- 
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2 ~ 2tE 
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2fE J { %d'^^ - Mo^ K. fpwHM 7; Y ( 1,*^^°' ) ~ 1 > fpWHM Mo < Mq 
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T*(mo,/;*,Doi,Dos,M):=2Wm' "^"^^ 



-M(r 



2(A* + 1) 
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(16) 
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In equation (16) the FWHM timescales for light curves that show finite source signatures are related to the values fpwHM for the point- 
source approximation using equations (9) and (10). This demonstrates that the source does affect the timescale of an event severely: 
a source with an impact parameter of one-tenth the projected source radius will have an event timescale almost 6 times as long as that 
in the point-source approximation. 

The shortest and longest FWHM timescales for an event with finite source signature (mq < Mq) equal (insert mq = Uq and mq = into 
eq. [16]), 

'FWHM.min - 'E ^ \-^()) ~ V J „, 



vi Do, ' 



»FWHM,max ' 



(17) 



2 g-floi 



For a given transversal velocity the minimum timescale becomes the larger, the larger the source sizes are. 
The largest flux excess of a lensed, extended star becomes 



Af,„,ax = Fo(A*-l) = Fo 



^ ^ 16GMDos(£)os-£)oi) ^ 



irrespective of whether the light curve shows finite source signatures. 



(18) 



2.2. Extracting Observables from Light Curves 
2.2.1. Measuring Af and tFwroA 

In this section we present three methods for measuring the excess flux at maximum and the FWHM time fpwHM- One can see in 
equations (9) and (16) that tu and mq (or Aq) enter the value of ?fwhm as a product, giving rise to the "Einstein time magnification" 
degeneracy, which may lead to poor error estimates for tu (and uq) even for well-determined values of ?fwhm and Ap. 
Accounting for this degeneracy, Gould (1996b)* approximated the Paczynski light curve with one fewer parameter for the special case 
of high amplification: 



' ^eff 



(t-tof 



+ 1 



-1/2 



(19) 



The three free parameters are Fgff := ^, feff := uqIe, and to- This approximation has turned out to be a very useful filter for detecting 
lensing events; however, it fails to describe light curves when the magnification is not very large. We suggest using 



Af{t) « Af 



fFWHM^ 



+ 1 



-1/2 



(20) 



instead. This approximation provides a good description also for lower magnifications. The three free parameters of this approxima- 
tion are the time of maximum to, the excess flux , and the FWHM timescale ?fwhm- 

Figure 2 shows that equation (20) better approximates the Paczynski light curve than the Gould approximation in the core and in the 
inner part of the wings, and also provides the correct value for fpwHivi and Af. 

There are two situations that can require a fourth, additive, free parameter in the light curve fit. The first one is the transition regime 
from pixel-lensing to microlensing (i.e., where the errors are small enough to sample the wings of the light curve). We suggest using 



Apit) « Feff 



(t-to? 



-1-1 



-1/2 



-Fo, 



(21) 



which provides an excellent fit to the Paczynski light curve (see Fig. 2, green curve). 

The second situation is the following: imagine that the photon noise of the background becoming larger and finally exceeding the 
unlensed flux of the star Fq. Then the star cannot be resolved anymore and the rms error of the baseline of the light curve becomes 



proportional A^^ 



■1/2 



data points 



a. The (minimum) systematic error is given by the fact that the subtracted reference image (with error a^ef) 

is a sum of (high-quality) images, potentially including some of the amplified phases of the sources.^ This implies that there are 
fundamental limits to the accuracy of the baseline, and we thus require an additive parameter to account for that. The approximation 
of any pixel-lensing light curve then becomes 



Apit) « Feff 



(t-tof 

ttS^ 



+ 1 



-1/2 



+C. 



(22) 



In fact, numerical simulations showed that much more accurate values are derived for F^s and t^fs if this additional constant C is 
allowed for. 



Gould's (1996b) eq. (2.4) with P = uq, u> = Ie'^ . 
' The measured light curve varies around the theoretical light curve L(t) due to noise of amplitude a. We can therefore write the light curve measured at times t as 

L(0 + (T := [AF(0+B + (T]-[AF(?ref)+B + (Tret] = Af(0- AF(?ref)-CTret + 0- = ApiO + Cr + C', 

with the background B, the epoch for the reference measurement tj^t, and a constant C := -Ap(tjet)-crret- 
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Fig. 2. — Different light curve approximations using tlie following parameters: fpwHM = 2, Aq = 20, Fq = 1, «o = 0.05005, and (e = 12.28. Black curve: Paczynski 
(eqs. [1] and [2]). Red curx'e: Gould (eq. [19]). Blue dashed cun-e: Eq. [20]. Green dashed curve: Gould fit with additional free constant (eq. [21]). Gray line: Af /2 
marks the flux level where fpwHM is defined for the Paczynski curve. 

2.2.2. Constraining Fo 

In this section we address the important question, how to extract the source flux Fq from a tensing light curve. There are four potential 
ways to constrain the flux of the tensed star: 

1 . The tensed star is resolved and isolated, and therefore a bias in the flux measurement (by crowding) can be excluded (assuming 
no systematic effects in the baseline). One would of course call such an event a classical microlensing event. A microlensing 
fit (using analysis methods) to the light curve then directly provides Fq and its probability distribution, ideally given by an 
Gaussian error (Tf„. In this case the flux measurement error is directly correlated to Q, the signal-to-noise ratio at maximum 
magnification.^ 

2. The flux F() is obtained through the information that is in the shape of the wings of the difference light curve Fo{A{t)- 1) + C = 
FoA(t)+B. The analysis leads to a probability distribution for Fq. This flux estimate method is used if no alternative unbiased 
flux measurement is available, i.e., cases in which the source star is resolved but blended (see (Alard 1999) for applications in 
the microlensing regime), and cases in which the source star is not resolved (usually called a pixel-lensing event). Note that 
other methods using the shape of the wings (Baltz 8c Silk 2000) provide similar results. 

3. The flux Fq is obtained from an additional, direct measurement, e.g., low-noise, high spatial resolution photometry from space. 

4. The flux Fi) is constrained by theory through plausible distribution functions, e.g., the luminosity function the color- 
magnitude relation of stars, and the distance distribution of stars, which together yield the source flux distribution function 
(see § 8.3). Another constraining example is an upper source flux limit that can be obtained from the fact that the source star is 
not resolved in the absence of lensing. 

Since the physical processes are the same in pixel-lensing and microlensing, microlensing is a special case of pixel-lensing, where the 
source flux probability distribution is much more narrow than the stellar luminosity function, i.e., the distribution function used in the 
pixel-lensing regime. The methods only differ in how to analyze a light curve and how to derive the probability distribution for the 
source flux. 

2.2.3. Evaluating tE 

In this section we use the distribution of Fq (from measurement or theory; see previous section) to estimate the probability distribution 
for a value of Ie- Note that transforming the distribution of Fq to a distribution of Ie can lead to a different value compared to a Ie 
obtained directly from the best estimate for Fq. 

As the fitting process in the light curve analysis yields the non degenerate observables fpwHM and A^, we can combine their (Gaussian) 
measurement errors with the probability distribution for the source flux Fq and obtain the probablity distribution for Ie'- 

PtM) =111 Pl,mm(tFWHM)PAA'^F)PFo(Fo))6{tE-'-^)dtEWHMdAFdFo 

= 111 P,^UMitFWHM)pAr(AF)PF,(Fo) ^<'^^"-'^^' dtEWHMdApdFo (23) 

= / / /'rFw„„(fET)pA,(Af)/,f„(Fo)T(^ + l) dApdFo. 
This also allows to include non-Gaussian distributions for the source flux. 

By transforming the measurements of A/? and fpwHM together with a probability distribution of Fq, we derive a general formalism that 
is applicable to all microlensing and pixel-lensing problems. In § 8 we further develop this idea using plausible distribution functions 
as physical constraints, which narrows the width of the distribution of the lens mass M (connected to Ie). 

' If the noise is dominated by background sky, one can write Uf^ = AfQ^^N^,^^^ ^l^i^, where A'daia points is the number of the light curve data points. 
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3. DISTRIBUTION FUNCTION FOR LENS PARAMETERS 

For a source of fixed intrinsic flux Fq, position 5 = ix,y,Dos) and velocity vector Vs = iVs^,Vsj,Vs,z), the number and characteristics of 
lensing events are determined by the probability function p(ri,vi,M) for a lens with mass M and velocity v/ being at position r*/. For 
the change of magnification of the background source, only the transversal velocity components of source and lens are relevant (we 
assume velocities to be constant). For parallax microlensing events (Gould 1994a,b) the nonuniform velocity of the observer changes 
the observed fight curves, since the observer's reference frame is not fixed. However, this effect is unimportant for extragalactic 
microlensing events. 

Therefore, in addition to M and Do] only the projected relative transversal positions r := r^j - §^rt. v and velocities Vt := Vt./ - and 
the angle (p enclosed by relative position and velocity vector enter the lensing properties. The distributions in r and (p can be reduced to 
the distribution of one parameter, the impact parameter b of the lens-source trajectory. This is obvious, since in a symmetric potential 
the trajectory of a particle is fuUy described by its miiumum distance. 

So, the relevant lens parameters are Doi, n, M, and b. We introduce the lens density and the distributions of Dqi, vt, and M in the next 
two subsections and then come up with a new lensing event definition in § 3.3. For those lenses that satisfy the event definition, i.e., 
those which cause events, we will then derive the distribution of the impact parameters dN/db. We will show that our event definition 
gives the familiar relation for the event rate but is more easy to implement in numerical simulations. 

3.1. Distance and Mass Distribution 
The probabiUty distributions for a lens with mass M being at distance Doi are given by 

PD,,=piDoi) [ piDoOdDoi , (24) 



PM = m 



j i{M)dM 

.0 



(25) 



where p{Do\) is the lens mass density and ^(M) is the lens mass function (which itself is normalized to J ^{M)MdM = 1; see Binney 
& Tremaine (1987, p. 747)). The number density per lens mass interval finally is defined by 



n(DouM):= p(£)oi)C(M), 

where n(Doi,M) has units of length"^ mass"' . 

3.2. Velocity Distribution for Lenses 
We assume that the velocity distribution of the lenses around their mean streaming velocity is Gaussian: 

1 



(26) 



p(vi,i)=Qe , i=x,y,z, Q 



277(71 



where cf\ is the dispersion and depends on the position {x,y,z). We furthermore assume that the combined transverse motion of 
observer and source relative to the mean transverse streaming velocity of the lenses is known and occurs in the x-direction with 
amphtude vo{x,y,z) as projected onto the lens plane. This means that the velocity v., of the source turns into a projected velocity 
Vp = Doi/Dos Vs (lensing timescales are determined by relative proper motions not absolute motions of lens and source). 
We now define the relative projected velocity v;,^ := v;^+ vq (analogously visj := v;^^ +0) and obtain the transverse lens-source velocity 
distribution as^ 

Pv,(vt,vo) = J J s(vt-y^vis/ + visA e\p (- '■""^'^"^ ] exp(-^) dvu^ydvu^ 

-CX)-CX) ^ ' ^ / \ / 

- ^ ? 7 <-'-^-v'^)^< ""-" V^) exp f-^^ dv, dv, 

= I exp (-^j -^==Ldv,s, 

= ^nexp(-^)/o(^). 
Here the Bessel function /q stretches the distribution depending on vq. 

' We extract the desired distribution functions using 



(27) 



= J J p{x,y)5(s- s(x, y)) dxdy ■■ 



ds{x,y) 
dx 



yi 

-I- 



dxdy = J p(x(s,y),y) 

yo 



Note tliat if ds(x,y)/ dx\ ^^^^ ^.^ has a different domain for y tlian f(x,y), the limits for y have to cliange to yo and yi 



ds{x,y) 




dx 


x=x^s,y) 



dy 
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Fig. 3 . — For a projected lens-source separation r and an angle <j> between the projected distance vector and the projected relative velocity vector the impact parameter 
of the source-lens configuration '\%b = rsin(</)). The lens approaches the source only for angles between -7r/2 and 7r/2. 



3.3. Impact Parameter Distribution for Events 

In Paczynski (1986) definition for lensing events (liereafter called standard definition) lens-source configurations become lensing 
events if the magnification of a source rises above a given threshold within the survey time interval At. This means that for each lens 
mass one can define a "microlensing-tube" along the Une-of-sight to the source, which separates the high-magnification region from 
the low-magnification region, and a lens causes an event if it enters the tube. 

We use (for the motivation, see § A) an alternative event definition: a lens-source configuration becomes an event, if the lensing 
hght curve reaches the maximum within the survey time Af. This definition does not specify any specific magnification threshold 
at the time of maximum magnification because this magnification threshold will in reahty depend on the observational setup and the 
brightness of the source. We show that the impact parameter distribution for the maximum hght curve event definition agrees with the 
standard definition, if the same magnification threshold is used. 

For simplicity we consider lenses with one mass, distance, and velocity, for the moment only. The lenses are homogeneously dis- 
tributed points (in two dimensions) with density n and velocities of Vt (the velocities can have arbitrary directions, but the angular 
distribution of the velocities must be the same for all the points). The number of lenses per radius interval around the line-of-sight to 
the source is 

^(r) = nli^r. (28) 
dr 

If r is the source-lens distance at the beginning of the survey and (f) is the angle that the lens's velocity vector encloses with the 
lens-source vector at that time, then the configuration will become an event with impact parameter b if b < r < \/b^ + (vtAt)^ and 
b = r\ sin(0)| with ^ e [-7r/2,7r/2] holds (see Figure 3 and § A). Therefore, ^ can be derived from the spatial distribution of the lenses 
relative to the source, and the distribution of the angles between velocity vector and distance to the source. For the special case in 
which all lenses have isotropic velocities of Vt, the probability for the angle between radius vector and velocity vector is independent 
of the location of the lens and equals 

P<t>i(t>)'-= 0<(j><2Tr, 

ZTT 

and 

i/CviAO^+fe^ 7r/2 ^(viAty-+h^- tt/2 . 

mb) = J 2 J f^6ib-rsm(j>)dHr = n J 2 J r^^^^^dcj^dr 
= n f -^=dr = 2nvAt. 

b 

In this equation, the radial integration limits correspond to the minimum and maximum source-lens separation for an event with impact 
parameter b within Af , and the (5-function then allows only for those trajectories through r that have the correct angle for the impact 
parameter b = rsm((f)) of interest. The factor of 2 accounts for integrating from to 7r/2 instead of -7r/2 to n/2 in the angle. In the 
second line of this equation we have changed the variable in the (5-function from rsin(0) to </> and then have carried out the angle 
integration and finally the r-integration. The quantity ^ has units of length"^ 

Note that ^ is independent of b; i.e., the impact parameters of the events are uniformly distributed. Of course, in reahty, an upper limit 

bmax will be present, depending on the source brightness, background light and the observing conditions. The integral jj^'"'"' ^db = 
2n Vt Af X /jjnax is dimensionless and equals (for the considered line-of-sight) the number of lenses that cause an event above a minimum 
magnification (corresponding to /^max) within A/. 

Equation (30) can also be obtained from geometrical arguments: a circle with radius b embedded into a two dimensional plane defines 
a cross section of 2b to streaming particles in that plane, independent of the streaming direction. Therefore, the number of particles 

passing through that aperture with diameter 2b in a time Af is n\\At2b. Hence, equation (30) also holds for a coherent particle stream, 
with any velocity direction. Therefore equation (30) is also vaUd for any probability distribution of the velocity angles. 



(29) 



(30) 
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The number of events per line-of-sight distance Dqi, lens mass M, transversal velocity vt, and impact parameter b follows from equation 
(30) by replacing n with n{Doi,M, Vt) = n(£»oi,M)pv,(vt,£'oi): 

=2niDoi,M)p,XvuD,i)vAt = 2p(Doi)aM)PvXyuD,i)vtAt. (31) 



dDoidMdvtdb 

We now transfer the number of the events per Une-of-sight to the event rate (per line-of-sight), T := and write equation (3 1) as 

With the relative impact parameter uq defined as uq = r^^^^^ m) ^^^^ distribution can be rewritten as 

doJudMu. = 2p(OoO^(M)Pv.(v.Z)oO v./?H(Z)„.,M), (33) 
which corresponds to the event rate for the standard definition (see de Rujula et al. (1991)).^ 

4. THE SOURCE DISTRIBUTIONS 

In the case of pixel-lensing the parameters of the source cannot be determined. Therefore, we now introduce probability distributions 
for the source distance Dos, velocity v^, unlensed flux Fq, color C, and radius /?* (for finite source effects). 

4.1. The Transverse Lens-source Velocity Distribution 

We again assume that the velocity distributions of lenses I and sources s are approximately isotropic around their mean respective 
streaming velocities (cf. equation (27)). The projected velocity dispersion of the source population we call = Do\/Dq^ CTj. We define 
vq = (vo,;c,vo,y) as the difference between the projected streaming velocities of the source and lens populations. Then, the transverse 
velocity differences in x and y between a lens and a source, each drawn from their respective distributions, are: v;,^ := v;^- v,^ + vq^ 

and vis,y :=vi^y-Vs,y + vo,y. 

Similar to equation (27), we obtain for the distribution of the transverse velocities 



Pv,(vt,vo) = j 5[vi-.^vu/ + vis/jp{vu^)p{vu,y)dvu^dvu^y, (34) 
where p{vis^) [and p{visj) analogously] is given by 

+00 "l"00 / ^2 2 2 2 \ 

Pivis^) =CiCs J J exp f- '^^"'^^^;?/'-' ]S(vis,x-{vi^-Vs^+vo^))dvi^xdVs^ 

-OO -oo ^ 

+00 +00 / ~ 2 2 2 2 \ 

= CiCs J J exp [- '^'"'2^|2'^2"" j S(vi^-(vis^+Vs^-vo^))dvi^dVs^ 

= QQ J^exp (-1^) yi^^v., (35) 



In the last step we have defined 



<^is:=\jai^+{^) C7,\ (36) 



which is the combined width of the velocity distribution of the lenses and that of the sources, projected onto the lens plane. 
Finally, analogously to equation (27), we obtain 

1 / Vt^ + vl\ ( VoVt 

— vtexp - Mo — 



PvXvt, vo) =—Vt exp -— — — ^0 — 2 ) ' (37) 



with vo(x,y,Doi,£>os), ai(x,y,Doi), and as(x,y,Dos). 

4.2. The Luminosity Function 
The luminosity function (LF) (fi (flux"') or $ (mag"') is usually defined as the number of stars per luminosity bin""'. 

** de Rujula et al.'s (1991) eq. (10) with dV = d'^T, D = A,s, x = Doi/Das, r^l^ixd -x)]'/^ oc Re, poH(x) = p(Doi), dno/d^i = g(M), = uq yields d'*r ■- 
2Z)os V, pv, (V, ) Re p{D„t ) ((M) dM duQ dvi {dD„t /Do,). 

Note that we neglect the correct indizes refering to the bandX and define Fq = Fox, Fvega = ^Vegax, ^ = ^x,^q = x, A/t = Af^, = Aix,C =Cx-x' 
H = Hx,{MlL) = (MlL)x,A=Ax. 
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The mean, or so-called characteristic flux of a stellar population is 



J<PiJ^dJ^ ' ^^^^ 



or, if one instead uses the luminosity function $ in magnitudes,^ 



where Fvega is the flux of Vega. 

We use a luminosity function normahzed equal to 1, 



j 4>iJ^dJ^:= j ^{M)dM :=1, 



+00 

(40) 



as we obtain the amplitude of the LF from the matter density and the mass-to-light ratio of the matter components (bulge, disk) later 
on. 

The luminosity functions in the literature are usually given for stars at a distance of 10 pc. The relations for the source flux Fq at a 
distance Dos and its flux at 10 pc, or its absolute magnitude M. are given in the following two equations, allowing for extinction 
along the Une-of-sight: 

F,{M,xj,D,,) := FvegalO-O-4^ x (T) ' 10^-4-^(-'^'^»>. (42) 

4.3. The Number Density of Sources 

We characterize different source components (bulge and disk) by an index s with corresponding indices in the density, luminosity, and 
mass-function of that component. (M/L)s is the mass-to-light ratio of that component in solar units. 

The number density of sources is a function of the mass density, the mass-to-light ratio, and the characteristic flux of each component: 

r r, ^ Ps(x,y,Do,) 

Note that (M/L) is the mass-to-light ratio of the total disk or bulge component, and has to include the mass in stellar remnants or in 
gas. Therefore, the value of (M/L) is not necessarily equal to the stellar mass-to-Ught-ratio in the bulge and the disk. 
The normahzed probabihty distribution for sources PsiDos) at distance Dos is 

/•r> \ PsiDos) rAA\ 

PsiDos) • (44) 

J PsiDos) dDos 



4.4. Including the Color and Radius Information 

To use the color information, C := - A^', we construct a normahzed color-flux distribution Pcmdi-M.,C) from the color-magnitude 
diagram of stars, 

' PcmdiM,C)dMdC=l, (45) 



which is related to the luminosity function as 

^iM) = j PoUM,C)dC. (46) 
The radius is related to the luminosity and color as R^iM.,C) (see § B.4). 



' With dM = -(2.5/ln I0)dr/r the conversion of the luminosity function from flux to magnitudes becomes ^(M) = -0.4 In lOFvegalO"" '*^ (/-(fvega lO"^ ''-^) . 
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5. APPLICATIONS FOR THE MICROLENSING REGIME 

In this section we derive the basic microlensing quantities and distributions using the four-dimensional event rate differential derived 
in § 3. We apply the equations to M31 using the M31 model in § B. 

5.1. Optical Depth r 

The optical depth r is defined as the number of lenses that are closer than their own Einstein radius to a line-of-sight. The optical 
depth T is therefore the instantaneous probability of lensing taking place, given a line-of-sight and a density distribution of the lenses. 
For a given source star at distance Dos, the optical depth equals the number of lenses within the microlensing tube defined by the 
Einstein radius /?e(A^,£>oi,Z)os) (eq. [4]) along the line-of-sight: 

D„, DC DC Re 

r(Dos) = J J J J niDoi,M,Vi)x2TTrdrdvtdMdDoi 



Dos oo oo 

= / / / p(Doi)aA^Pv,(vt,Ooi) / InrdrdvtdMdDa 

(A-]\ 

= J J p(Doi)aM)TrRE^dMdDoi 



= ^ Jp{Doi)DiDoi)dDou 



with D{Da\) := Doi(Dos -Do\)/Dos, equal to Paczynski (1986, eq. (9)). Equation (47) demonstrates that the optical depth depends on 

the mass density, but not on the mass function ^(M) of the lenses. 

In the past, the optical depth along a line-of-sight to M3 1 was often calculated by setting Z)os equal to the distance to the plane of the 
disk of M31 (Gyuk & Crotts 2000; Baltz & Silk 2000). This is like treating the sources for lensing as a two dimensional distribution. 
It yields fairly adequate results for the optical depth of disk stars but cannot be justified for the bulge stars in M31. We use the source 
distance probabiUty distribution (equation (44) to obtain the line-of-sight distance-averaged optical depth: 

< r >, := / PsiDos)TiDos)dDos. (48) 



Figure 4 shows the average optical depth for the central part of M3 1 for lenses in the halo of M3 1 ("halo-lensing"), and for stellar 
lenses in the bulge and disk of M3 1 ("self-lensing"). The self-lensing optical depth is symmetric (with respect to the near and far side 
of M31) and dominates the optical depth in the central arcminute of M31. The halo-lensing optical depth is asymmetric and rises 
toward the far side of the M31 disk, since there are more halo lenses in front of the disk. 

Figure 4 (first row, left) shows the halo-disk optical depth. The results do not depend so much on the three-dimensional structure of the 
disk but much more on the halo core radius assumed. We use r^ = 2kpc (see § B). Gyuk & Crotts (2000) used core radii of Tc = 1 kpc 
and = 5 kpc for their Figures Ic and Id, and our result is in between their results, as expected. Baltz & Silk (2000) have obtained 
quahtatively similar results using rc = 5 kpc, but assuming an M3 1 distance of 725 kpc and a slightly less massive halo than we do. 
The optical depth caused by all M31 components is shown in Figure 4 (third row, left). The result of (Han 1996, see his Fig. 1) using 
a halo core radius of rc = 6.5 kpc looks strikingly different. Comparison to Figure 4 (third row, left) demonstrates that the total optical 
depth is dominated by bulge lenses in the central part of M31. The last panel of this figure shows the optical depth for bulge-lensing 
toward M3 1 sources. The bulge-lensing optical depth had been obtained by (Gyuk & Crotts 2000, see their Fig. 5), but the values that 
they obtained are up to a factor of 5 larger than ours (which probably is due to their different M31 model). 

5.2. Single-star Event Rate 

The optical depth is the probability of stars to be magnified above a threshold of 1.34 at any time. Observations, however, usually 
measure only a temporal change of magnification. Therefore, the event rate, which is the number of events per time interval, is the 
relevant quantity for observations. The event rate is the integral of equation (32) over lens masses, lens distances, relative velocities, 
and impact parameters b smaller than a threshold utRe: 

oo Dos OO Ht^E 

dDoidMdvtdb ' 



dD,dMdv,db dbdvidMdDo 





Do, 



= 2mt / ^(M) J p(Doi)Re(DouM) J ViP^,(vuDox)dv,dD^xdM 



(49) 



= 2mt^ / p(D^,)s/D(P^ JVM^(M) J v,p,Xvt,Doi)dvtdDoidM 



= : MTri(Dos)- 

This had been first evaluated [using a single mass instead of ^(M)] by Griest (1991).^'* 

Eq. (11): changing his notation with T = Tj, Vc s V2cri, L = Dos, \fm = v^, /^i+^i+^.i = p(Doi), x' s Dgi/Dos, v = vo/iVlai), u' = vt/(V2o-i): 



Ft = 4 



/Mo 

corresponds to our formula setting ^(M) = 6{M-Mq)/Mq, ijs = 
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Fig. 4. — Contours of the line-of-sight-averaged optical depth (t)^ (eq. [48]). x and y are given in the intrinsic M31 coordinate system, which is centered on the 
nucleus of M31 and where the M31 disk major axis is orientated horizontally (P.A. = 38°). Halo lensing of disk sources (first row left, a), halo-bulge lensing (first row 
right,'), halo-lensing of bulge & disk sources (second row left). The average optical depth for self-lensing of sources in M31 is shown in the second row on the right. In 
the third row, left panel, we show the resulting total optical depth with the contributions of all lenses. The third row, right panel, displays the optical depth due to bulge 
lenses. The optical depth caused by the MW (not shown), is nearly constant tmw = 0.78 x 10"^. To obtain the values of < r >s we used the model of the luminous and 
dark matter of M31 presented in § B. Here and in all following calculations a MACHO fraction in the dark halo of M31 of unity was assumed. The spacing between 
adjacent contours are shown as inserts in each diagram. The contour lines < t 2 x 10~* are shown as dashed curve. 



The impact parameter threshold uj is equivalent to a magnification threshold At. Therefore, the number of events with amplifications 
larger than At(mt) is proportional to the threshold parameter mt- 

ri(Z)os) is the event rate along a chosen line-of-sight to a distance of Dos. Analogously to the optical depth, we also define the 
line-of-sight distance-averaged single-star event rate 

<Ti>, = J p,iDos) riiDos)dDos (50) 

toward M3 1 . 

We show these line-of-sight distance-averaged event rates for the halo of M3 1 and the stellar lenses in the bulge and disk of M3 1 
(self-lensing) in Figure 5; the single-star halo-lensing event rate is evidently asymmetric, whereas the single-star self-lensing event 
rate is symmetric. The levels of the event rates (for each hne-of-sight) are of the order 10"^ events yr~' (dashed), which impUes that 
at least a few times 10'' source stars are needed to identify one lensing event (even if all lensing events below the threshold mt = 1 could 
be observed). It can also be seen in Figure 5 that only in the innermost part (r < 5') the self-lensing event rate exceeds the halo-lensing 
event rate (for a 100% MACHO halo). As mentioned earlier, the optical depth does not depend on the lens-mass distribution (for the 
same matter density) because the decrease of number of lenses with lens-mass is balanced by the increased area of the Einstein disks 
around them. However, the events take longer, since larger Einstein radii have to be crossed. For the same optical depth, this then 

—1/2 

must imply a decrease in event rate: F ^ Mq ' , setting ^(M) = 5{M - Mq) / Mq in equation (49). The decrease of the event rate with 
increasing mass of the lenses can be seen in Figure 5 (first row, right panel) and Figure 5 (second row, left panel). 
The relations above give the event rate per hne-of-sight or per star. To compare this with measurements of the lensing rate for resolved 
stars, one has to account for the source density. 
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Fig. 5. — Averaged event rate < Fi (y ') towaj"d M31 for bulge-bulge lensing yire? row, left), halo-bulge lensing with Mq = 0.5Mq (i.e., a mass function 
i(M) = S{Mq-Q.5Mq)/0.5Mq) (first row, right), halo-bulge lensing with Mq = O.IMq (second row, left), and halo-disk lensing with Mq = O.lMg (second row, 
right). Contour levels and the spacing between adjacent contours are given on top of each diagram. The dashed line marks the Kr'' events yr"' level. Whereas 
self-lensing is symmetric, halo lensing shows a clear asymmetry. The event rate shows a maximum at the far side of the M3 1 disk (negative y- values). These contours 
cannot be compared with an experiment since, first, one could certainly not identify all objects with a threshold of iij = 1 or a magnification of 1 .34 and second, one 
has to convolve the single-star event rate with the density of sources. The proper event rate maps in the pixel-lensing regime can be seen in § 7. 

5.3. Distribution for the Einstein Timescale 

Not only the number of lensing events per time and their spatial distribution but also their duration (Einstein time) is a key observable 
in microlensing surveys. The distribution of the Einstein timescales of the events is 



dtE duo 





OoDos OO 



OO Dos OO ^ 

/ / J p(Doi)aM)Pv,ivt)2v,RE6(tE-^)dvtdDoidM 

n n n ^ 







5(v 

2 J J J p(Doi)aM)py,(^t)vtRE4£7^dvtdDoidM 



OO Dos / \ 

^11 p(DodaM)pJ^)RE'dDoidM. 



(51) 



The second line of equation (51) is proportional to the equation presented in Han & Gould (1996a).' 
' ' Their eqs. (2.2.6) and (2.2.7) with /(Ie) oc p(Je), dmin = 0, dmax = £>os, and /m(M) oc ^(M), 



i-'os OO OO / 1 \ 

p(tE)<x J e(M) J p(Doi)Re(M,Doi,Dos) J npv,(vt) J g^;^,'_ ^E(M0,Doi,Dos) ^ s(tE-(^^^ ' tAdtE'dvtdDoidM. 

n an \ / 
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Fig. 6. — Line-of-sight distance-averaged distribution of the event rate with Einstein timescale {clF/dtE), (yr ) using the model of M31 presented in § B and 
assuming a 100% MACHO halo. Results are shown for two positions in the intrinsic M31 coordinate system (see Figure 4), at (jc,y) = (l',0') (red), and at {x,y) = 
(4. 46'. 4. 46') (con'esponding to (x,y) = (Ikpc, Ikpc), blue). The bulge-bulge Einstein time distribution is shown as solid line. The halo-bulge distributions have 
been evaluated for a MACHO mass of Mq = 0.1 Mq (dotted line) and Mq = 0.5 Mq (dot-dashed line). The halo-disk lensing case is shown for a MACHO mass 
of Mq = 0.1 Mq (dashed line). The Einstein time distributions of the event rate differ considerably for halo-disk, halo-bulge and bulge-bulge lensing and also vary 
significantly with the line-of-sight position. For comparison we also plot the Einstein timescale distribution for M31 halo-lensing derived by Han & Gould (1996a) up 
to a pre-factor (that we chose equal to 3 X 10"*) as a black solid curve. Han & Gould (1996a) considered the distributions for the halo-disk and halo-bulge lensing to be 
similar and not distinguish between them further. They used a MACHO mass of Mq = 0. 1 Mq for their curve. However, it looks more similar to our halo-bulge curve 
for Mo = 0.5 Mq , and cannot be moved on the halo-bulge or halo-disk for Mq = 0. 1 Mq curve with another choice of the prefactor. 

The result is of course independent of the relative impact parameter mq. If one carries out an (microlensing) experiment with a 
threshold uj, one obtains with equation (51) the Einstein timescale distribution of events as 



oo Dos 





(52) 



This result corresponds to that of Roulet & Mollerach (1997)'^ and Baltz & Silk (2000)'^. The (normaUzed) probability distribution 
for the Einstein timescales becomes 

1 dTj 

P(tE) ■= 



Ft dtE 

With this probabiUty distribution the average timescale Je of an event with line-of-sight distance D^s can be obtained: 

oo oo A 



(53) 



UDos) := / tEp(tE)dtE = ^ J J J p{D,,)i{M)pJ^)tE^dD,,dMdtE 

\ E / E 

oo OO i)os OO Z)o>i 

= |rr/// PiDox)£.{M)p,XvdRE^dDadMdv,= J p{Da)m)'^RE^ dD^^dM = If^^, 







(54) 



which equals the result of Alcock et al. (1995).'"* 

We instead aim for the line-of-sight distance-averaged mean Einstein timescale [at an arbitrary position {x,y)\. We start from the 
line-of-sight distance-averaged event rate per Einstein time t^,. 



dVT \ f dVT 
-r^) = pAD,,)-7^dD,,. 
dtE , J dtE 



(55) 



Figure 6 shows examples for this line-of-sight distance-averaged distribution {dVj/dtE) ^ for two different positions in the intrinsic 
M31 coordinate system (see Fig. 4), at (x,y) = (l',0') and ix,y) = (4.46', 4.46') = (Ikpc, Ikpc). The distributions show a strong 
dependence on the line-of-sight position. The halo-bulge and halo-disk lensing timescales are longer than those of bulge-bulge 
lensing. An increase in MACHO mass decreases the event rate (see Fig. 5), and the timescale of the events becomes longer (see the 
examples for Mq = 0.1 M0 andMo = O.5M0 in Fig. 6). 

Weighting Je with this function and integrating over all timescales finally yields the desired mean line-of-sight distance-averaged 
Einstein timescale of an event: 



oo , , 



(56) 



Their eq. (31) corresponds to our formula conveiling their notation to ours F = F/ht. dn/dm = p(Da[)^(M), 7" = (e, i'^ = I't, J^^ d'-fv-^ '^(''dis)'^(^'dis) = Pi'i(^'t)- 
Their eq. (9) corresponds to our formula converting their notation to ours L = Dos, 'e = 2rE. fh = «T. I'c = V2a\, x = D„]/Dos, rj = \'o/V2o"i), v = \\/(\/2ai) 
and setting g(M) = (5(M-Mo)/M, CTs = 0. 
Their eq. (2) with t = 2t^ . 
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Fig. 7. — Einstein timescale averaged over all sources (/b), calculated for lines of sights toward the center of M31 using the model of M31 presented in § B. We 
show the distributions for bulge-bulge self-lensing (first row, left), halo-bulge lensing for lenses of Mo = 0.1 Mq [i.e., a mass function ^(M) = <5(Mo -0.1Mq)/0.1Mq] 
(first row, right), halo-bulge lensing for lenses of Mq = 0.5 Mq (second row, left), and halo-disk lensing with Mq = 0.1 Mq (second row, right). Contour levels and 
spacing are shown on top of each diagram. We assume the MACHO fraction in the dark halo to be unity. The dashed line marks the lowest contour level. 



Mean Einstein timescales (fe), are shown for lensing and self-lensing in Figure 7. Generally, the minimum of (fn), is near the M31 
center, irrespective of the lens-source configuration. The mean Einstein timescale is smaller for lower MACHO masses, since the 
Einstein radii become smaller and are faster to cross (compare the two middle panels in Fig. 7). The bulge-bulge lensing events (first 
panel) are the shortest. This is caused by the small lens-source distances, which reduce the sizes of the Einstein radii. 



5.4. The Amplification Distribution 



The magnification distribution of the event rate is 



dT 


duo 


dV 


duo 


r 


dAo 


dAo 


duo 


dAo 





Inserting equation (8) makes the result equal to that of Griest (1991).'^ 
Transforming equation (51) we can write 



d^r _ 2 
dtndAo tiB^ 



duo 



OO Dos 



dAo 



p{Dox)m)P 



(57) 



v,( ^ ]RE'dDoidM. 



(58) 



using duo/dAo from equation (8). 

15 Eq. (22): with A = Ao, F' = i , = -i^ [AoCAo^ - ir'/^ - ir'l\A^^ - \)-^l\ 
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5.5. The Distribution for the FWHM Timescale 

Although the Einstein timescale ?e contains all the relevant physical properties (mass, position, and velocity) of the lens, it is of limited 
practical use in the case of an ill-determined source flux ("Einstein time - magnification degeneracy", see § 2.2). In this case fpwHM 
is the only properly measurable timescale of a light curve. We obtain the distribution function for fpwHM (neglecting finite-source 
effects), starting from § 3, using ?FWHM(vt,M,Z)oi,Mo) = ^'^^*^'^°'^ w(mo) (see eq. [9]): 



OO Dos CO 



dtFv/HM duo 



J J J p(Doi)aM)py,(vt)2viREs(tpwHM-^w(uo))dvtdDoidM 

^ ^ 

OO Dot, OO ^ J '^E ( '\\ 

2 J J J p(Doi)^(A/)Pv.(vt) Vt/?E ^Vt^ 7° dv.dDoidM 

\-7;t«("o>\ 

OO OO , 

J J p(D,i)^(M)wHuo)RE^pJ^wiuo))dDoidM 



^FWHM 



_ 2wHuo) 

fpWHM^ 





OO 



(59) 



OO Do& , V 

/ i(M) J p(Doi) ^ w(uo) Re^ dDoi dM. 



Baltz & SiUc (2000) expressed the same relation in an alternative way^^ and already motivated the same change of variables from Vt to 
?FWHM- Our relation for the FWHM time distribution of the event rate in equation (59) does not include any derivative or inversion of 
w(uo) and thus is very easy to evaluate numerically. Note that one can use w(uq) ~ \/l2 "o as high-magnification approximation. 
Replacing the relative impact parameter mq by the maximum amplification Aq (using eqs. [7] and [8]) yields an equivalent description 
of this result: 



d^T _ 2^!(Ao) 

dtp^ffUMdAo ?FWHM^ 



OO D„ 

I m) I piDoi)pJ-^TiAo)]RE'dDoidM, 

J J V'FWHM / 



with /?e(M,Doi, Dos) and *(Ao) as 



*(Ao) := 



duo 



dAn 



T\A = ^^/o [-^o + (-^o'-l)'^']'^' [(Ao+l)3/2-Ao(Ao + 3)'/^] Aq^i 12_ 



(Ao'-l)V4(Ao + 3)i/2 



(60) 



(61) 



where 



duo 
dAo 



d'r 



, at the position (x,y) = (l',0') in the intrinsic M31 coordinate system 



was defined in equation (8). 

Figure 8 shows the distribution of events \ -n — r, TrjTrri — t- 

(see Fig. 4), i.e., on the disk major axis.'^ 

Small amplifications are favored, which implies a strong dependency of the total number of events on the experimental limit of Aq 
(e.g.. At). Figure 8 can be compared with sensitivity regions of current experimental setups for microlensing experiments toward M3 1 . 
As these are usually only sensitive to ?fwhm of larger than 1 day, it is extremely unlikely to detect maximum magnifications larger 
than lO"'. These high-magnification events can only be routinely detected with combined observations from several sights located on 
different longitudes, with large telescopes allowing short integration times, or from space. Note that recently after an alert detection 
and intensive follow-up monitoring, Dong et al. (2006) could measure a lensing time-scale of ?fwhm ~ 0.05 d and a magnification of 
the order 3000. 



6. APPLICATIONS FOR THE PIXEL-LENSING REGIME 

The microlensing parameters (Fq, ?e. and mq) are not directly observable anymore in crowded or unresolved stellar fields. In that case, 
the two measurable quantities are the full- width timescale ?fwhm and the difference flux of an event. 

We now make use of the luminosity function $(A^), the source number density ns(x,y,Dos), and the color distribution pcmd(.M,C) of 
the source stars introduced in § 4 and derive the event rate distribution function ■ This quantity can then be Unked to the 

measured distributions most straightforwardly.'** 

In the first two subsections (§§ 6.1 and 6.2) we derive the required distributions neglecting finite source effects. However, the high 
magnifications needed to boost MS stars to large flux excesses go in parallel with finite source effects that make these large flux 
excesses hardly possible. We show this in detail in § 6.3, where we incorporate finite source effects in the calculations. 

16 We can derive their expression in eq. (10) with ri = -3— , v = —S^ ,M = Mn: 

v2<Ti V2cti 

Si^ =/// / vanoi H^FWHM-af v.(«o)) dbdv,dD,,dM 

= Tj7p(DoiK(M)v^Mvt) {^'[^-'{'^)] y e(2^wM-vt)dvtdD„,dM 

using i(M) = S(M-Mo)/Mo. 

We have now changed to logarittimic units for timescale and magnification, and also converted the probability density according to that. 

Note that this distribution function is different from dV/dwF = dV/diAp ?fwhm) derived by Baltz & Silk (2000) for the flux-weighted timescale wf := Af tFWHM- 





-3 a -1 1 

logio twin ' Id) 



-3 -Z -1 1 
logio thwti^n / [dl 



-3 -a -1 1 



Fig. 8. — Distribution (^d^T / (rflog?FWHMi^logj4o))j (y ') (i e., rate per year and line-of-siglit) for different lens-source configurations, calculated at the position 
(x,y) = (l',0') in the intrinsic M31 coordinate system (see Fig. 4). The values of timescale and magnification are largely confined to a linear region within the 
logarithmic timescale-magnification plane. First row, left panel: Bulge-bulge lensing; middle panel: disk-bulge lensing; right panel: MW-halo-bulge lensing for MW 
lenses with Mq = O.IMq. Second row, left panel: Halo-bulge lensing with Mq = IOOOMq; middle panel: halo-bulge lensing with Mq = O.SMq; right panel: halo- 
bulge lensing with Mq = O.IMq. The dashed contour level line marks the 10"** (y"') level in each diagram. The number of events rises toward smaller amplifications 
and larger timescales. Small ampHfications are strongly favored for all lens-source configurations. The number of high-amphfication events rises for smaller fpwHM 
timescale. Detecting events toward M31 with maximum amplifications of Aq > lO' therefore requires an experiment that is particularly sensitive to short timescale 
events. The dashed blue line shows for each selected timescale the amplification where the distribution is maximal [91og(Ao) = 0]. 



6.1. Changing Variables ofT to TpwHivi <^nd Af 
6.1.1. Event Rate per Star with Absolute Magnitude M 
We now use the relations Re{M,Do\,Dos), Ao(Fo,Ap), fFWHM(t^'t,^E,^), "^[AoiFojAf)], and T[Ao(fo, A/r)] from § 2 and the equations 
fFWHM(v„/?E,f;),Af)= fET(Ao), AF(Fo,b,RE) = Fo [A(b/RE)-l] and =-8|^M,r^ , and we obtain the event rate per 

FWHM time, per flux excess, per lens mass and per source star with an absolute magnitude A4: 



dtpWHM dAf dMdJV[ 



= ^(-^) / / I mJhwrt^i^F-F^ [a (i) - 1] }'5{rFWHM-f w}d^,Jv.JD, 





Dos oo oo 



_ ...-os^^ 5|fo-«E[2A(,(A(,2-ir'/--2l''''| , . 

*// ! PiPvM- PSFI ^-5[tE^^yi-^L0)dbdv,dD^x 

I ■"' I ^ ' 

Dm ^ / \ 

= ^H7^J I P^P^'' ^t-^E <5(fFWHM - ^ T dv,dD,i 

n ° / « A (62) 

Dos oo 51 r ^ Tl 

I vC- I 

= 2$i^T2— [ p(^p (^T\RE^dD„\ 

Fa dAa Tfwhm- J V 'fwhm I 

^ ^ 

= = ^(MKiM) I / p(D„,)^E^ A. t) dD^,. 

^ ^ 

using the luminosity function in magnitudes <b(M ) and the conversion from absolute magnitudes to intrinsic source fluxes F^)(A4, Dos) 
(eq. [42]). Equation (62) is the transformation of equation (32) to the observables relevant in the pixel-lensing regime. It gives the 
event rate per star with absolute magnitude A4 and will be converted to the event rate per area using the density of stars below. For 
the special case of highly amplified events, (Aq ^1), the approximations w 12(Fq/ Ap)'^ and T w ^/T2(Fq/ Ap) can be inserted into 
equation (62). 
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Fig. 9. — Event rate per area, per FWHM time and per flux excess, d'^F /{dx dy dlogtjivmM ''log ^f). obtained from equation (63) by mass and magnitude integration. 
Tlie contours are shown in units of y"' arcmin"^, timescales and flux excess liave been inserted in units of days and Jy. The equations are evaluated at (x.y) = (l',0') 
in the intrinsic M31 coordinate system (see Figure 4), i.e., at a distance of 1' along the disk major axis. We show bulge-bulge lensing (left), halo-bulge lensing with 
Mo = 0.1 Mq lenses, and bulge-disk lensing (right). The contour levels are given as inserts in each diagram. The red dashed line marks the 10"' y"' arcmin"^ level 
in each diagram, brighter areas correspond to higher values. The double-wave shape of the contours with bulge stars as sources is caused by the shape of the PMS 
luminosity function of the bulge sources (see Figure 10). For the results shown in these panels all sources have been treated as point sources. For finite source effects, 
see § 6.3. 

0.1.2. Event Rate per Area 

All previously derived event rates are per star, or per star with a given absolute magnitude Ai. Observed, however, are event rates per 
area. These are obtained from the source density distribution along the line-of-sight nj(x,3',Dos) and equation (62): 

oo 

d'r,,, 2$,(X)e;(M) f ^ ^ X* /■ ^ "^eT 



dx dy c/f FWHM A/7 dM dAi 



fpWHM^ 



vq dDoidDo 



(63) 





where the quantities in the integral have the following functional dependences 
'^(Ao{F(),Af)), T(A()(F(),Af)), V(){x,y,Doi,Dos)- Equation (63) is the event rate per interval of lens plane area, FWHM time flux 
excess, lens mass and absolute magnitude of the lensed star. For highly amplified events one can replace ^I^ and T in the integral by 



\ fpWHM 

Fo(7W,Dos), Pi(x,y,D,i), RE(DohM,D,,) 



l2{Fo/AfT and V12(Fo/Af), respectively. 

Different lens (disk, bulge, or halo) and source (disk or bulge) populations are characterized by an index I and s in equation (63). For 
the total event rate Ftot one has to sum up the contributions of all lens-source configurations: 

dxdydtp^f/uMdAfdMdM ' ^ dxdydtf^iiudAfdMdM' 



(64) 



The event rate per area is then obtained by multiplying equation (64) with the efficiency e(.x,y, A;7,fpwHM) of the experiment and 
integrating over all lens masses and source magnitudes, and the timescale and flux excess. The probability that one can observe two 
stars lensed at the same time at the same position is practically zero, since J^^ ^(d^Tiot/dxdy) dxdy 1 holds. 

We carry out mass and magnitude integration of equation (63) for the position {x,y) = (l',0') in the intrinsic M31 coordinate system 
(see Figure 4), i.e., at a distance of 0.22 kpc along the disk major axis and show the results for bulge-bulge, halo-bulge, and bulge-disk 
lensing in Figure 9. Compared to Figure 8 the contours are smeared out in the AF-direction, since they come from convolving those 
in Figure 8 with the source luminosity function. 

In Figure 10 we demonstrate for the halo-bulge lensing case in Figure 9 that the "double-wave shape" in the contours in the two left 
panels of Figure 9 indeed is caused by the luminosity function of the PMS stars. We split the source stars into post-main-sequence 
(PMS) and main-sequence (MS) stars and plot the corresponding contours into the middle and right panels of that figure. The double- 
wave shape appears only in the PMS figure. Besides that, it becomes obvious that PMS stars cannot be lensed into events with short 
timescales and small flux excess. This is because the faintest PMS stars in the M31 bulge have an unamplified flux of 8 x 10""^ Jy and 
thus need an amplification of only a factor of 2 to yield a flux excess of A/r w 10"^ Jy. Magnifications that small are incompatible with 
short timescales according to Figure 8. In contrast, MS stars need very high amplifications to reach a flux excess comparable to that 
typical for PMS stars. According to the right panel in Figure 10, ultra-short, large excess flux events with MS source stars would be 
more common [compare, e.g., the contour levels at log(fFWHM/[d]) = -3 and log(AF/[Jy]) = -4] than events with PMS source stars. 

6.2. Including Color Information in the Event Rate 

The color of a point source remains unchanged during a lensing event, since the lensing amplification does not depend on the frequency 
of the source light. In practice, microlensing events with blending by nearby stars, and any event with finite source signatures may 
show chromaticity in the light curve (see e.g. Valls-Gabaud (1995); Witt (1995); Han et al. (2000)). The difference imaging technique 
eliminates all blended light from the lensing light curve. For lensing events without finite source effects the color of the event therefore 
equals that of the source and can be used to constrain the source-star luminosities. 
Replacing ^{M) with /:'cmd(A^,C), and dM with dM dC (see § 4.4), we obtain 



dx dy t/fFWHM dAf dM dM dC rpwHM " 



■Pcn,dAM,C)^,(M) 



ns(x,y,Dos) — 
Fq 



Pi{D^\)RepJ 



ReJ_ 

VfpWHM 



,vo dDoidDo 



(65) 
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Fig. 10. — Event rate per area, per FWHM time and per flux excess, d'^V jdx dy JlogfpwHM dXogHf), for halo-bulge lensing with Mq = O.IMq calculated at the 
position (-V,)') = (l,0)arcmin. Left panel: Distribution for PMS and MS sources. Middle panel: Distribution for PMS sources alone. Right panel: Distribution for MS 
sources alone. The contour levels are shown as inserts in each diagram. The red line marks the 10"^ y"' arcmin"" level in each diagram. The double -wave shape of the 
PMS distribution is due to the two peaks in the PMS luminosity function of the bulge sources (see Figure 25 in § B). For the results shown in these panels all sources 
have been treated as point sources. 



We derive lens mass estimates starting from equation (65) in § 8. We also demonstrate there that including the color information leads 
to considerably smaller allowed lens mass intervals than for the case in which color information is ignored (i.e., the case in which lens 
mass probability functions are derived from eq. [63]). 

Equation (65) allows to reconstruct the mass function of the lenses and the MACHO fraction in the dark halo (see de Rujula et al. 
(1991); Jetzer & Masso (1994); Jetzer (1994); Mao & Paczyhski (1996); Han & Gould (1996b); Gould (1996a)). In this way one can 
obtain the optimal parameterization for the mass function ^/(M) using a maximum-likelihood analysis for a set of measured lensing 
events. If the ingredients for the kernel (i.e., all but the pre-factor ^/(M) in eq. [65]) are accurately provided by theory and the number 
of lensing events is large, then the mass distribution can be derived solving the Fredholm integral equation of the first kind. Inversely, 
a certain ensemble of lenses allows conclusions on the based distribution functions. 



6.3. Event Rate Taking into Account Finite Source Effects 

As described in § 2.1 the point-source approximation is no longer valid, if the impact parameter uq is smaller than Mq, i.e., half the 
source radius projected onto the lens plane (equation (14)). In this case, the maximum amplification and thus the flux excess stays 
below the value for the point-source approximation, and timescales of events are enlarged (see Eqs. 16 and 13). Baltz & Silk (2000) 
already accounted for the upper limit in magnification and obtained the correct value for the total number of events (i.e., events 
with and without finite source signatures) as a function of magnification threshold. Their approximation, however, is limited to high 
amplifications and ignores the change of magnification and event timescale.'^ Thus, the flux excess and timescale distributions of the 
events are not predicted accurately. 

We have shown in § 2.1 that finite source effects are likely already for small maximal magnifications and that the timescale changes 
due to finite source effects can be large. Therefore, we derive precise relations and account for the finite source sizes as follows: 

1. Events with uq > Uq, i.e., those for which the finite source sizes are irrelevant, are treated as before; we redo all calculations 
starting from equation (31), and if the impact parameter b is involved in an integral we multiply the integrand with 9{b — R^u'!^y, the 
step function allows only contributions in the integrand, '\fb> ReUq holds. To see how this transports into the c/Doi-integration if the 
variables are changed from b and Vt to fpwHM and AF in the Eqs. 62, 63 and 65 



l{b-REU*o)=9{A*-Ao). 



where we are using the following relations: 
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" Baltz & Silk's (2000) eq. (26) with eqs. (20) and (22) can be written in our notation as (see footnote 16) 
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Multiplying the integrand of equations (62), (63), and (65) with equation (66) extracts only those light curves, where finite-source 
effects can be neglected. 



2. For events where the finite source sizes are relevant, i.e., events with uq < Uq, we use the approximations for the maximum 
ampUfication and the FWHM time given in equations (15) and (16). This means that we just replace the relations for the impact 
parameter and the maximum magnification and the FWHM timescale relations of events by equations (13) and (16) when switching 
from the point source to the finite source regime. We then can derive the equations for the event rates with finite source effects 
from equation (32) analogously to the point-source approximation, but this time with a step function of 9{R^ul-^-b) in the integrands 
allowing only small impact parameters. WithAQ(Doi,Z)os,^*,M), Uq{Do\,Dos,R*,M) and Vt = (/^eAfwhm) T*(Aq,mo) and its derivative 
\dtT:mmivt)/dvt\ = (/?E/vt^) T* we obtain 



dxdydtpwHMdApdMdM 
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(67) 
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dDUApM 



dAp 



= 2Cd:PJ{Fq + Af) (Cd- Dos + Af{2FQ + Ap)) ~ ICo'^Dos^Af (C/,. Dos + Af 



Alternatively a transformation inverting /fwhm(^) is possible.^*^ We use the values for the source radius, luminosity, and color relations 
R^,(M,C) summarized in Appendix B (§§ B.3 and B.4). 

Figures 11 and 12 show contours of the event rate per fpwHM timescale and flux excess, per year and square arcminute, with finite 
source effects taken into account. We use the same position as before, at (x,y) = (l',0') in the intrinsic M31 coordinate system (see 
Figure 4) or at a disk major axis distance of 1'. The upper panels show the distribution for light curves showing no finite source 
effects (eq. [63] with eq. [66]), whereas the lower panels show the distribution obtained from mass and source luminosity integration 
of equation (67), i.e., for light curves affected by finite source effects. 

The black areas indicate the event parameter space, which is not available to source stars once their real sizes are taken into account: as 
finite source effects mainly occur at large amplifications, large Af and small fpwHM values are suppressed. Events in the point-source 
approximation, which fall into the black areas in the upper panels of Figures 11 and 12, end up with longer timescales and lower 
excess fluxes (lower panels) if the sources sizes are taken into account. The sharp cutoff at large flux excesses arises, since there is 
an upper limit in Af depending on source luminosity and size (see equation (18)) and since the luminosity function of the stars has 
a steep cutoff at giant luminosities of Mr = -0.83mag (bulge) and Mr = -2.23 mag (disk). The maxima with vertical contours for 
finite source effects in the lower panels come from shifting events for which the point-source approximation "just" fails at longer times 
scales (see eq. [16]). Light curves with finite source effects have (depending on their flux excesses) most likely FWHM timescales of 
about O.Oldays, or 15 minutes, and the sources lensed with that timescales are MS stars. The secondary maxima around 1 day and 
flux excesses of 5 x 10~* to 2 x 10"^ Jy for bulge-bulge, disk-bulge, and 0. 1 Mq halo-bulge lensing, and of about 10"^ Jy for 1000 
halo-bulge lensing, are due to lensing of PMS stars. 

In general, the ratio of lensing events with and without finite source signatures is minute for fpwHM ^ 0.5d and AF < 10~^'Jy, and 
raises to about an order of unity for bright lensing events with AF{) w 1.6 x 10"^ Jy (corresponding to a magnitude of the excess 
flux of niR = 20.7 mag) for bulge-bulge-lensing and AFq w 5 x 10"^ (niR = 19.5 mag) for halo-bulge lensing with O.IMq lenses. We 
compare the first column, bulge-bulge lensing, with results for the same lens-source configuration in Figure 9, which had been obtained 
assuming the full validity of the point-source approximation. The ratio of these contours is shown in Figure 13. The parameter space 
of interest for current surveys are flux excesses > 10"^ Jy (excess magnitude of niR = 21.2mag) and timescales between 1 and 200 
days. One can see that the true event rate can differ strongly from that for the point-source approximation depending on the flux excess 
limit of the survey. The brightest events are preferentially suppressed. This means that taking into account the source sizes is essential 
for predicting the correct number of lensing events. 
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Fig. 11. — (fiV / (rflog(FWHM d\og/S.F) [arcmin^^ y"'] at (jc,y) = (l',0') in the log(Af )-log(iFWHM) plane, for bulge-bulge {leji), disk-bulge {middle), and halo-bulge 
{right) lensing with 0.1 Mq MACHOs (columns 1-3). The upper panels show the distribution for light curves not affected by the finite source sizes. The contours have 
been obtained from inserting eq. (63) into eq. (66) and carrying out the mass and source luminosity integral. The lower panels show the distribution for Ught curves 
with finite source signatures (mass and source star luminosity integral of eq. (67). The contour levels can be read off from the inserts in each diagram. The dashed line 
marks the 10"^ arcmin"^ y"' level, areas with brighter colors correspond to higher contour values. Taking into account the finite source sizes implies an upper limit for 
Af and a lower limit for fpwHM for all light curves, i.e., for Ught curves with and with out finite source signatures (see text). For the source-lens configurations shown 
here there are no lensing eventsHght curves with excess fluxes Af > 5 X 10"'* Jy. The results shown here have been obtained by taking into account the source sizes of 
lensed stars. 

Furthermore, one has to be aware that a fair fraction of the brightest lensing events show finite source signatures in their light curves 
and might be missed when using event filters with a classical lensing event shape in a stringent way. For the detection of finite-source 
events or even of binary lensing events less stringent thresholds or modified filters are needed, which, however, enhance the risk of a 
mismatch with variable source detections. 

Finally, Figure 12 compares halos with different MACHO masses in its first and second row. An increase in MACHO mass dra- 
matically reduces the event rate and increases the event timescales. This explains the shift in the contours toward longer timescales 
(compare the change of the Ao-fFWHM contours in Fig. 8) and the decrease in the contour levels. For larger MACHO masses, Einstein 
radii do increase, and one expects finite source effects to become less important: the largest possible flux excess A/^ ^ax for the lensing 
events indeed increases; the size of the shift is as expected, since the maximum flux excess is proportional to the square root of the 
MACHO mass according to equation (8). The contours in the last row of Figure 12 show MW-halo lensing with Q.IMq MACHOs. 
Finite source effects are unimportant. Figures 1 1 and 12 make it obvious that lensing events above the maximum flux excess predicted 
for self-lensing would be a clear hint for either massive MACHOs in M3 1 or MACHOs with unconstrained masses in the Milky Way. 
Figure 14 shows the distribution for bulge-bulge lensing splits in color space. The selected color intervals are 0.0 < R-I < 0.5, 
0.5 < R-I < 1.0, and 1.5 < R-I < 2.0. In the bluest color interval (first column) we find MS stars close to the MS turnoff as well 
as SGB, red clump and some RGB stars. The medium red sample contains MS, RGB. and AGB stars, and the reddest sample (last 
row) contains stars in the RGB and AGB phase and no MS stars. As expected, the timescale of the most likely finite source lensing 
events changes with color: for the bluest color interval MS stars are responsible for the most likely finite source signature events 
and the event timescales are very short. The secondary maximum is caused by red clump and SGB stars, which are brighter, need 
less magnification and therefore have longer event timescales. The color interval of 0.5 < R-I < 1.0 contains the central part of the 
MS, and RGB and AGB stars. The MS stars are fainter (in R) and have smaller radii than those in the blue sample, and therefore, 
the maximally probable event caused by the MS stars is at lower flux excess and timescale than that for the bluer sample. The PMS 
stars are brighter (which enhances the possibility of longer timescale events) and have larger radii (which leads to stronger peak-flux 
depression by finite source sizes) than in the bluer sample, and therefore, the events have similar brightness but take longer on average. 
The reddest color interval, 1.5 < R-I < 2.0 contains the reddest PMS stars and no MS stars. These PMS stars are fainter (in R) and 
have larger radii than those contained in the 0.5 < R-I < 1.0 sample, and therefore suffer most strongly from finite source effects 
causing events with even longer timescales than for the bluer PMS stars. 
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Fig. 12. — Same as Fig. 1 1 for halo-bulge lensing with larger MACHO masses (0.5 A/q and IOOOM0 MACHOs in the first and second column) and for Milky Way 
halo - M31 bulge lensing (for 0.1 Mq MACHOs in the third column) As before, the dashed line marks the 10"^ arcmin"- y"' level in each diagram, and areas with 
brighter colors correspond to contour higher values. The results shown here have been obtained by taking into account the source sizes of lensed stars. 



bulge-bulge halo(Mo=0.1)-bulge 




logio( tfwhm / [d] ) logio(tfwhm/[cl] ) 

Fig. 13. — Ratio of the event rate distribution for extended sources counting events not showing finite source signatures (equation (66) and Fig- 
ure 11) to the event rate assuming pure point sources (equation (63) and Figure 9), as a function of flux excess and timescale of the events: 

[d2r/(rflogfFWHMrflogAf)]^_^,|^^^^^^^^/[j2r/(dlogrFWHMrflogAf)]p^.^i^_^^^^^^ for bulge-bulge (left) and O.IM© halo-bulge (right) lensing at (x,y) = (l',0') in 

the log( A;r)-log(?FWHM) plane. The dashed red line marks a ratio of 0.999. The finite source sizes cause a strong suppression of the brightest lensing events relative to 
the point-source predictions. 

7. APPLICATION TO EXPERIMENTS: TOTAL EVENT RATES AND LUMINOSITY FUNCTION OF LENSED STARS 

We now apply our results from §§ 5 and 6 to difference imaging surveys. The goal of this section is to predict realistic event rates 
that take into account observational constraints (like timescales of events and the signal-to-noise ratios of the light curves, e.g., at 
maximum). These event rates can be taken for survey preparations or for a first-order comparison of survey results with theoretical 
models. Exact survey predictions and quantitative comparisons with models can be obtained with numerical simulations of the survey 
efficiency. 
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the distribution for bulge sources within different color intervals, (S— /) £ [0,0.5] mag (first column), (R-I) S [0.5, 1.0] mag (second column), and (R—I) 6 [1.5,2.0] mag 
(third column). The upper panels show the distributions for Ught curves showing no finite source effects, whereas the middle panels show the distributions for light 
curves with finite source signatures. The lower panels show the sum of both distributions. The contour levels are shown as inserts in each diagram. The dashed line 
marks the 10"^ arcmin"^ y"' level in each diagram, brighter areas correspond to higher values. See text for more details. 



7. 1 . "Peak-Threshold" for Event Detection 
In order to identify a variable object at position ix,y), its excess flux Af has to exceed the rms flux aF(x,y) by a certain factor Q: 

Ar(^,3') = Q<JF(x,y). (68) 

The parameter Q characterizes the significance of the amplitude of a lensing event, but not of the event itself, since that also depends 
on the timescale (and the sampling) of the event. We will call events characterized by the signal-to-noise ratio at maximum light 
"peak-threshold-events" in the following (Baltz & SiUc 2000). Considering only the maximum flux excess of an event (and not its 
timescale) of course can lead to an over-prediction of lensing events, since events might be too fast to be detected. In addition, long 
timescale events with low excess flux can have many data points with low significance for the excess flux, which all together make 
a significant lensing candidate. The detectability of events therefore depends on both its amplitude (flux excess at maximum) and its 
timescale. This is the reason, why we derived the contribution to the event rate as a function of flux excess and FWH]VI timescale in 
§6. 

The flux excess threshold that a source with intrinsic flux Fo{A4,Dos,A) must achieve in order to be identified as an event can be 
translated to thresholds in maximum magnification and relative impact parameter using equations (1) and (7): 



Aj(x,y,Fo(M,D,,,A)) := Ar(^,}')^b"' + 1 



Ar(^,3')^b- 



UT(x,y,Fo{M,Dos,A)) := 



2Ai 



1 



(69) 



(70) 



in both cases we have also given the high-magnification approximations in the last step. 

In contrast to the microlensing regime (where uj is assumed to be constant), uj depends on the local noise value via A™"(x,y) 
and the luminosity A4 of the source star being lensed. In Figure 15 we show contours of the minimum magnification required to 
observe an event at a distance of Dos = 770kpc, source luminosity of Mr = Omag and a signal-to-noise threshold of 2 = 10 for a 
survey like WeCAPP in the R band. Since the M3 1 surface brightness and thus also the rms photon noise increases toward the center, 
magnifications of 50 or larger are needed in the central part. The 1VI3 1 rms photon noise and rms flux within a PSF in the R band had 
been estimated using equations (72) and (73) below. 

To obtain an upper limit for the event rate, we assume that all events with flux excesses above the peak-threshold can be identified, 
irrespective of their timescales. In previous event rate estimates the fFWHM timescales have only been considered correctly in IVlonte- 
Carlo simulations. Ignoring the event timescales in analytical estimates the event rate predictions are much more alike the upper limit 
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we present here (eq. [71]). In this case one can simply use the transformation from minimum flux excess at maximum magnification to 
the threshold relative impact parameter uj in equations (69) and (70) and integrate equation (33) over mass, lens distance, and relative 
velocities, multiplying it with the relative impact parameter threshold UTix,y,Fo(M,Dos,A)) and the number density of sources with 
brightness M, nj(x,y,Dos) ^s(-A4), and finally integrate along the Une-of-sight and source luminosity, (§ 4.2): 



2 OOOO ooDcsOO UT(x,y,Fo(M,D„s,Ay)RE 





dDoidMdvidb 



db dvt dMdDoi dM dD^ 



= J fn,ix,y,Dos)^siM)uTix,y,FoiM,Dos,A))TijiDo,)dMdDo 



(71) 







In this equation, the subscript "s" indicates the different stellar populations (bulge, disc) and their sum yields the upper limit for 
the total event rate. This upper limit for the event rate can therefore be also obtained as a product of the single-star event rate Fi ; 
(equation (49)), [using UT{x,y,Fo{M,Dos,A))] and the number density of sources with luminosity A4 on the line of sight. Equation 
(71) is similar to the equations of Han (1996, eq. (2.5)) and Han & Gould (1996a, eq. (2.2.1)).^' 

Up to now we have not discussed the value of (Tf(x,y), i.e., the value of the rms flux that appears in the equations for the detection 
thresholds. This value can in principle be taken from the error propagation in the reduction process. Due to varying observing 
conditions (seeing, exposure time of the co-added images per night), the errors can differ from day to day by a factor of up to 10; 
to obtain predictions for the most typical situation, one therefore should use the median error at each image position of a survey to 
predict the rms flux aF(x,y). 

Riffeser et al. (2001) showed that using our reduction pipeline (that propagates true errors through all reduction steps) errors in the 
light curves are dominated by the photon-noise contribution of the background light. Therefore the typical error can be estimated from 
the surface brightness^"" profile iJ,ix,y) of M31 and the typical, i.e., median, observing conditions of the survey. Using analytically 
predicted rms-values, one can study the impact of the observing conditions on event rates and optimize survey strategies. 
To measure the variability of objects one has to perform (psf-)photometry defining the angular area of the psf ilpst ;= n9^^f/ln4 and 

the FWHM of the psf ^psf. For a given experimental setup the rms photon noise crphotonix,y) within an area fipsf [arcsec'^] at a position 

ix,y) is 



o-photon (-«,>') := [(10 



-0.4(/i(j:,y)+KAM) 



+ 10-»-4'^->)lO-°-4(-^P'fexpf^ 



psfj 



11/2 



(72) 



where Hsky [mag arcsec"^] is the sky surface brightness, fexp is the exposure time in seconds, AM is the air mass of the observation, 
ZP is the photometric zero point of the telescope camera configuration in photons per second and k is the atmospheric extinction for 
the observing site.^^ 
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Fig. 15. — Contours show the minimum magnifications Aj that stars with Mr = Omag at a distance of D, 



20 30 40 



: 770 kpc need to exceed the M31 rms flux by a factor 
contour levels are A 

marks the Aj = 25 level; the square shows the field observed by WeCAPP given in the intrinsic M3 1 coordinate system (see Fig. 4). 



of!2=(^) = 10in the R band, for an experiment (with respect to, e.g., pixel size and seeing) hke WeCAPP. The contour levels are At = 17, 18, ...,32. The dashed line 



21 With /9max(Fo,i) S UT, Fq S ri,,(£>os), J ^,^„ .)Fo, = I "^''^"^ ■ 

100 Note that we neglect the correct indizes refering to the band X and define Fq =fox, ^Vega =Iyega^<^ = ^x,^q =-^©x. Af = Apj^,M = Mx, C =Cx-x' 
IJ. = tJ.xAM/L) = {M/L)x,A = Ax. 
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We have neglected readout noise of the detector because it is negligible compared to the photon noise. 
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The rms photon noise can be translated to the rms flux (in Jy) using the flux of Vega, Fvega, and its magnitude mvega = 0: 



^Ves 



,4ZP ^psf 
exp 



CrFy.X,y) .- ^ J Q-0.4(mv=8a+KAM-ZP) '^photon -^Vega {^^ + ) lU lU ^ 



(73) 



-1/2 1/2 

The last equation shows that the rms flux within an aperture is proportional to fexp , making the signal-to-noise Q proportional to fexp , 
as expected for background noise-limited photometry of point-like objects. 

The extincted surface brightness profile iJ,ix,y) in Eqs. 72 and 73 can be taken either from very high signal-to-noise measurements 
of M31 or from analytical models that are constructed to match the observed SFB-profile and dynamics of M31. In the latter case, 
the extincted surface brightness /x(x,>')-model combines the luminous matter density psix,y,Dos) with the mass-to-Ught ratio for each 
source components (i=bulge,disk) and accounts for Galactic and intrinsic extinction A(x,y,Dos) along the line-of-sight: 

M(x,3;) =-2.51og Lega-^ E 7lO-0-^-^(^.^.^">) ^^^yoJdD^ , (74) 



where the units are magarcsec 



7.2. "Event-Threshold" for Event Detection 

Gould & Han (Gould 1996b; Han 1996; Han & Gould 1996a) introduced an "event threshold", where the detectability of events 
depends on the total excess light of the light curves. They obtained an implicit equation for the threshold mt of the relative impact 
parameter, 

C(«t) _ tcyc(rFix,yf 2 

~ ~^ T^T"; — r-jtievent > ('->) 

"T tEix,y,Dos)lTFo 

where (x,j) is the rms flux at that position and zi(x,y,Z)os) is the mean Einstein time of the events (equation (54)); ( is defined by 

J [Ait)- if dt 



C("o) := 



/ [(^Ae)^ + «o^] ^dt' 



Fo{A4,Dos,A) is the unlensed source flux and ?cyc is flie (equidistant) difference between observations. 

This equation assumes equidistant sampling of the light curves and is therefore most readily applied to space-based experiments. In 
addition, it takes into account the mean Einstein timescale of events only, although the relative impact parameter threshold depends 
on the individual timescale of the event. For realistic event rate estimates, however, one has to to take into account the timescale 
distributions, as well. 

One can in fact obtain an analog relation for flux excess and ?fwhm timescale of the events (i.e., the actual observables). 



with 



Amin2;/ Fq \ _ VT2t,y,(7F(x,y? ^ 2 t^.. 

^\ A min ' 'FWHM - : yevent 5 (.'O; 

VA^'"' J tt/fwhm 



/(Ao - 1) [12(? Afwhm)2 + 1] 

Equation (76) can be numerically inverted to obtain the peak flux threshold A"'"(fFWHM,J^,>', A^) as a function of the event timescale. 
Therefore, it is obvious that the peak threshold and event threshold criteria are related assuming equidistant sampling and that the 
event threshold criterion is a special case of the peak threshold plus a ?fwhm threshold criterion, which is evaluated in equation (77) 
(see § 13)P 

7.3. Total Event Rate with Excess Flux Threshold A™" and Timescale f™HM Threshold 

The upper limit derived in § 7.1 still includes numerous events that cannot be detected in finite time resolution experiments. At this 
point, where not only must the flux excess (maximum magnification or relative impact parameter) of the event be considered, but also 
the timescale of the event, the transformation of the event rate from the "theoretical quantities" to the "observational quantities" in § 6 
becomes most relevant. Using equation (63) we can simply integrate from the lower limits A™" and f^!^HM infinity (or any other 
value specified by the experiment): 

oo oo 

•= / / , . — JAf JfpwHM (77) 

dxdy J J dxdydtFmmdAp 

with 



-II 



dxdydtYiwimd^F J J dxdydtYimmd^FdMdM. 

To be able to roughly compare the event rate predictions of Han & Gould (1996a), who used the event threshold criterion, we can assume f^^^HM ^<^y<^ C ^ 1 
and obtain Q ^ 12" "tt-o ^ q^^^^^ ~ O.egevent. 
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TABLE 1 

Observational setups for the WeCAPP survey and a potential experiment using the ACS on board of HST 



Parameter 



WeCAPP 



HST with ACS WFC 



fexp (s) 

'pixel (arcsec) 

Field of view (pixels) 

Field of view (arcmin) 

Filter 
ZP (mag) 
Average sky 

FWHM of the psf O^a (arcsec) 

Qpsf (arcsec^) 

AM 

Atmospheric extinction kr 
At (days) 
& (days) 

'fwhm (days) 

inner saturation radius (arcsec) 
CCD orientation angle (deg) 



500 
0.5 

2048 X 2048 
17.2 X 17.2 
Johnson R 



23.68 
20.0 
1.5 
5.1 
1.0 
0.1 
200 



200 
20 
45 



4096 X 4096 
3.37 X 3.37 

F625W 

25.73 (De Marchi et al. (2004), Table 3) 
22.5 

0.12(Krist (2003), p. 13) 



1000 
0.049 



0.033 


30 



20 





The thresholds fFWHM(-^i>') A™"(fFWHM,JC,>') (see eqs. [68] and [76]) are set by the experiment and the detection process.^"* 
For the WeCAPP experiment (see Table 1) toward M31 it turned out that the efficiency can easily be evaluated using Monte-Carlo 
simulations. As in the WeCAPP experiment errors are propagated through all reduction steps (Riffeser et al. 2001); the final errors in 
the light curve <j(x,y, f,) include the full reduction procedure. For a simple set of detection limits, i.e., A™" ^ a and t^^^f^ = const, the 
efficiency e(x,y, fFWHM, A/r) for a survey can easily be evaluated as a function of the directly observable parameters x, y, fpwHM, and 
Ap (in contrast to the variables Ie and Aq). This and more sophisticated thresholds (as used in Alcock et al. (2001b)) and efficiency 
simulations for WeCAPP we will present in a forthcoming paper. 
Using this efficiency we can generalize equation (77) to 



As the total event rate depends on the model parameters of the luminous and dark component, precise measurements of the event 
numbers and event rate's spatial variation can in principle constrain the source and lens densities [p/(x,y,Z)oi)> nsix,y,Dos)], the lens 
mass functions [^(M)], the distribution of the transversal velocities [pv,ivt,voix,y,Doi,Dos))], the luminosity function of the sources 
i^si-M.) or Pcmdsi-M,C)], and finally the MACHO fraction in the halo. There are, of course other valuable parameters, like event 
duration, flux excess distribution, color of the lensed stars, and finite source effects, which make the lensing analysis much more 
powerful than the pure counting of events. 

Table 7.3 summarizes the event rate predictions for the WeCAPP experiment toward the bulge of M31, using different realistic 
thresholds'^ for the signal-to-noise threshold necessary to derive "secure" events, and for f^HM- These numbers do not take into 
account that events caimot be observed when M3 1 is not visible (one-third of the year), that in the remaining time some - in particular 
short-term events - escape detections because of observing gaps, and that some of the area is not accessible for identification of 
lensing events due to intrinsically variable objects. We calculated the predictions for signal-to-noise thresholds of Q = 10 and 2 = 6; 
these thresholds correspond to flux excess thresholds of 6.2 x 10"'' Jy (Q=10) and 3.7 x lO'^'ly (Q=6) in the edges and 2.4 x lO'^Jy 
(Q=10) and 1.5 x 10"^ Jy (Q=6) 20" off center (outside saturation) of the WeCAPP field.''' The g > 10 events are events like those 
published in the past (e.g., WeCAPP-GLl and WeCAPP-GL2 have values of <2 ~ 85 and Q » 16), whereas Q = 6 should be more 
similar to the medium bright event candidates of MEGA. For the 2 = 10 cases we have separated events that do not show finite source 
effects in the light curves ("without fs") from those which show finite source effects ("with fs"). Finite source events are relatively 
more important for high signal-to-noise, short timescale self-lensing events. In most current pixel-lensing surveys, light curves with 
finite source effects are not speciaUy searched for and may preferentiaUy get lost in the detection process, unless one allows for a less 
good fit for bright events. 

For the 2 = 6, fpwHM = 2 days case we split the predictions into the near and far side of M31. Within our field, the predicted halo-bulge 
asymmetry is small, but the bulge-disk and halo-disk asymmetry are on a noticeable level. (Note that the disk-bulge lensing does show 
the reversed asymmetry). It has been pointed out in the past (An et al. (2004)) that dust lanes in the M31 disk are an additional source 
of asymmetry; this is obvious if one considers the spatial distributions of variables found in pixel-lensing experiments (see An et al. 
(2004), Ansari et al. (2004) and Fliri et al. (2006)). These can, however, be taken to quantitatively account for extinction, in addition 
to extinction maps. The values given in our table do not account for the small spatial dependence of extinction and thus place lower 
limits to the observed far-near asymmetry of the individual lens-source configuration. 

The comparison for different timescale thresholds (cases 111, IV, and V) shows that (except high mass halo lensing) the majority of 
events has timescales smaller than 10 days. A clustering of event candidates with short and long timescales as de Jong et al. (2004) 
observed for the MEGA analysis of the POINT AGAPE survey (they obtained 6 candidates with timescales smaller than 10 days and 
8 candidates with timescales larger than 20 d) can be hardly explained for the WeCAPP field. This is because, even for supermassive 

^ For completeness we can also introduce the color thresholds C™" and C""^, which may also depend on the experiment and use the distribution derived in eq. (65). 
2' This is equivalent to an efficiency c(.r.y, Af .fpwHM) = d{fyvjHM -f"wHM)^(^'^~^f '"('*->>')) experiment. 
A value of 10"^ Jy correspond to an "excess magnitude" of 21.2mag in the R band 
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TABLE 2 

Total EVENT rate r,j [y"'] for the WeCAPP experiment for self-lensing and halo-lensing 



Model 




T 
1 






TT 
11 








III 








IV 






V 




















Near Side 




Far Side 














b-b 


1.2 


+ 


1.9 


0.57 


+ 


0.68 


1.4 


+ 


0.98 


1.4 


+ 


0.99 


0.16 


+ 


0.046 


0.026 


+ 


0.0062 


hO. 1-b 


O.Z 


+ 


J A 


A 

4 


+ 


1 "7 
1. / 


O.J 


+ 


1.6 


7.1 


+ 


1.9 


u.yz 


4- 


U.U / 1 


U.ID 


+ 


u.uuy4 


nU.j-D 


7.4 


+ 


2.7 


4.4 


+ 


1 


5.5 


+ 


0.72 


6.3 


+ 


0.82 


1.8 


+ 


0.05 1 


0.4 / 


+ 


0.0074 


hlOOO-b 


0.7 


+ 


0.0013 


0.61 


+ 


0.001 1 


U.J i 


+ 


0.00051 


0.59 


+ 


0.00056 


0.75 




O.UUOzj 


A A 

U.6 


+ 


5 X 10~^ 


d-b 


57 




34 


26 




098 


89 


+ 


0.16 


0.087 


+ 


0.026 


057 




0029 


0072 








3.9 


+ 


0.0046 


1.9 


+ 


0.0019 


2.7 


+ 


0.001 


2.7 


+ 


0.001 


0.47 


+ 


4 X 10-5 


0.088 


+ 


5 X 10^ 


hMwO-5-b 


2.4 


+ 


0.0009 


1.5 


+ 


0.0002 


1.8 


+ 


0.0001 


1.8 


+ 


0.0001 


0.76 


+ 


2x 10^ 


0.23 


+ 


3 X 10-' 


b-d 


2.3 


+ 


1.6 


1.4 


+ 


0.77 


0.43 


+ 


0.22 


3.8 


+ 


1.3 


0.43 


+ 


0.049 


0.082 


+ 


0.0055 


hO.l-d 


11 


+ 


4.3 


6.6 


+ 


2 


4.9 


+ 


0.72 


12 


+ 


2 


2.1 


+ 


0.1 


0.5 


+ 


0.014 


h0.5-d 


8.6 


+ 


1.2 


5.8 


+ 


0.65 


3.6 


+ 


0.21 


9.5 


+ 


0.55 


3.1 


+ 


0.069 


1 


+ 


0.011 


hlOOO-d 


0.61 


+ 


0.00015 


0.56 


+ 


0.00014 


0.26 


+ 


5 X 10""^ 


0.7 


+ 


9 X 10-5 


0.7 


+ 


8 X 10-5 


0.55 




2 X 10-5 


d-d 


0.2 


+ 


0.13 


0.14 


+ 


0.075 


0.2 


+ 


0.056 


0.19 


+ 


0.056 


0.095 


+ 


0.018 


0.036 


+ 


0.0055 


hjvfwO.l-d 


3.1 


+ 


0.0019 


2 


+ 


0.00049 


2.2 


+ 


0.00025 


2.2 


+ 


0.00024 


0.95 


+ 


6x 10^ 


0.3 


+ 


7 X 10-' 


hjviwO-S-d 


1.9 


+ 


0.00022 


1.4 


+ 


0.00013 


1.4 


+ 


7 X 10-5 


1.4 


+ 


7 X 10-5 


0.97 


+ 


4 X 10-'' 


0.43 


+ 


5 X 10-' 



Note. — (d) disk; (b) bulge; (hO. 1) halo consisting of 0.1 Mq lenses; (hO.5) halo consisting of 0.5 Mq lenses (hmw tor MW-halo); (hlOOO) halo consisting of 1000 
Mq lenses. The numbers give the event rate in events per year integrated over the observed field and applying the following peak-flux signal-to-noise thresholds: (I) 
2=10 and t^^^^^ = 1 day; (II) Q = 10 and t^^^y^ = 2days; (III) g = 6 and f™J'jj^ = 2days for flie near and far side; (IV) Q = 6 and = lOdays; (V) g = 6 and 

'fwhm ~ 20days. These (S/N)-limits (at the light curve peak) are more reahstic than a flux threshold AJ?'", which is constant over the observed field, since the central 
region shows a strong gradient in the surface brightness and photon noise values. We have also separated events that do not show finite source effects in the light curves 
from those with finite source effects by a plus sign. Note that light curves with finite source effect signatures might be missed when using event filters with a classical 
lensing event shape in a stringent way. For the (Q = 6, ?fwhm = 2) case we split the predictions in those for the near and far side of the M31 (disk and bulge). Within 
our field, the predicted halo-bulge asymmetry is small, but the bulge-disk and halo-disk asymmetries are on a noticeable level. The comparison for different timescale 
thresholds (columns labeled "HI", "IV", and "V") shows that (except for high-mass halo lensing) the majority of events has timescales smaller than lOdays. A peak of 
events with timescales of 20 days or larger can only be understood with supermassive MACHOs or miss-identifications of variable objects. In the last line we add the 
analogous numbers for halo lensing resulting from Milky Way halo lenses of 0. 1 Mq . The MACHO events caused by the MW MACHOs should be roughly one-third 
of that caused by M31 MACHOs. 
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Fig. 16. — Event rate maps in logarithmic units (in units of events yr larcmin ^) for the WeCAPP survey for Q = 10 and fp^j^i^i = Id (column labeled "I" in 
Table 7.3). Coordinates are given in the intrinsic M31 coordinate system (see Figure 4). The contour levels are shown in inserts in the upper right corners of each 
diagram. The dashed line marks the 0.01 evy-' arcmin-^ level in each diagram. The event rate decreases near the center of M31 due to higher noise combined with 
finite source effects. Note that the maximum lensing (maximum optical depth) region defined by Crotts (1992) and Tomaney & Crotts (1996) is predicted at about 
l.Skpc (7.5') from the nucleus for a simple halo model. For the calculations we have taken into account the finite stellar source sizes; the numbers shown, however, 
include only those among all events that do not show finite source signatures in their Ught curves, i.e., those which are usually searched for in lensing experiments. 



MACHOs, one would expect roughly as many events between 2 and 20 days than above 20 days (compare case III and case V in 
Table 7.3). de Jong et al. (2004) argue that their long-term events arise in the outskirts of M31, where the photon noise is smaller, 
and could be understood from selection effects. This would still lack to explain the bimodality of timescales. At the moment it is not 
excluded that these long-term event candidates are still misidentified variable objects. In the last line we add the analogous numbers 
for halo lensing resulting from Milky Way halo lenses of 0.1 Mq. The number of MACHO events caused by the MW MACHOs 

See de Jong et al. (2006) for recent results. 
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Fig. 17. — Contours of the logaiithm of the lensing event rate per year within the 17.2' X 17.2' WeCAPP field as a function of the signal-to-noise threshold Q for 
the peak-flux excess and as a function of the timescale threshold of the events. For these numbers, eq. (77) was integrated over the WeCAPP field, and the noise was 
estimated for the WeCAPP survey (characterized by Table 1). We show results for different lens-source configurations, from left to right: bulge-bulge self-lensing, 
halo-bulge lensing with 0.1 Mq lenses, halo-bulge lensing with 0.5 Mq lenses, and disk-bulge self-lensing. The blue dashed line marks the 1 event per year level. For 
the calculations we have taken into account the finite stellar source sizes; the numbers shown, however, include only those among all events that do not show finite 
source signatures in their light curves, i.e., those which are usually searched for in lensing experiments. For signal-to-noise ratios of Q> 10, the rates for events with 
finite source effects can be of the same order as the rates for events with point-source light curves. For lower signal-to-noise ratios, events with finite source effects 
become much less important. 

should be roughly one-third of that caused by M31 MACHOs. 

Figure 16 shows the predictions for the spatial distribution of the lensing events for the WeCAPP survey, evaluated for the 2=10 and 
fFWHM = 1 day thresholds (column labeled "I" in Table 7.3). One can see that the event rate density becomes maximal close to the M3 1 
center for bulge-bulge and halo-bulge lensing configurations. That seems counterintuitive to the results about the lensing optical depth 
and single-star event rate in Figures 4 and 5, where the maximum is attained on the M31 far side, significantly offset from the center. 
This difference is due to the density of source stars, which rises toward the center much more than the single-star event rate and the 
detectability of the events drops. As can also be seen in Table 7.3, a far to near side asymmetry {lower and upper part in the figure) 
is not present for bulge-bulge lensing, is modest for halo-bulge lensing, and is stronger for the disk-bulge lensing. This is because the 
disk effectively cuts the bulge in one part in front and the other behind the disk, and only the stars in the second part can contribute 
to disk-bulge lensing. The bulge-disk self-lensing shows the opposite far to near side asymmetry and attains its maximum event rate 
per area in the far side of the disk. The same is true for the halo-disk lensing (main maximum on far side of disk), which shows a 
secondary maximum close to the M3 1 center caused by the increase of the disk-star density. The disk-disk lensing event rate per area 
is symmetric with respect to the near and far side of the disk. The fact that the maximum for bulge-bulge and disk-disk lensing is not 
located exactly at the M31 center is caused by the increased photon noise combined with finite-source effects. 

The total self-lensing (disk-bulge + bulge-disk + bulge-bulge + disk-disk) shows an asymmetry arising from the different luminosity 
functions and mass functions of the bulge and disk population which leads to different event characteristics for disk-bulge and bulge- 
disk lensing. Therefore, lens and source populations cannot easily be exchanged. The fact that the near side is closer to us - lensing 
strength and apparent magnitude of sources change by a few percent - than the far side of M3 1 plays a minor role for the asymmetry 
of self-lensing event rates. 

The last figure (Fig. 17) in this section shows the total event rates F/ j(f^jjj^,g) = / / ^^^^dxdy in the WeCAPP field depending on 
the peak flux threshold and the timescale threshold of the survey. We have taken into account the finite source sizes but show only the 
rate for those events that do not show any finite source signature in their light curves. 

For high signal-to-noise events (e.g., Q = 20), all configurations do show more or less flat contours in the fFWHM direction for fFWHM 
values between 0.1 and 1 days. This indicates that there are relatively few very high signal-to-noise events with timescales around 
0.1 days compared to events with timescales of about 1 day. That this is true is confirmed by Figures 11 and 12, which show that 
the highest signal-to-noise and thus highest flux excess events occur with timescales between 0.6 and a few days and that events with 
timescales of about 0.1 days are significantly fainter Only for smaller flux excesses, the events with timescales of 0.1 days can be as 
common as events with timescales of a couple of days. This implies that if one can measure only lensing events that have a (S/N) 
ratio of Q> 10 in the WeCAPP setup, one cannot greatly increase the number of observed events by increasing the sampling (the 
largest increase for sampling below 1 day would occur for halo-disk lensing). The detectability of events with timescales of hours can 
therefore be increased effectively only (in the central M3 1 field), if the noise level of the observations is lowered. It can also be seen in 
all panels of Figure 17 that one expects the number of events to decrease strongly for fpwHM-thresholds larger than several days. The 
quantitative differences in the different lensing configurations in the subpanels of Figure 17 can be easily understood (by combining 
the dependence of the event rate on event timescale and magnification with the luminosity function of source stars, and accounting 
for the difference in the importance of finite source effects) and are discussed for the 0. 1 Mq MACHO bulge and the 0.5 Mq MACHO 
bulge lensing case. Table 7.3 already suggests that these two lensing configurations are very similar for the event numbers that do not 
show finite source signatures (the O.IM© MACHOs do cause more high signal-to-noise finite source events). One expects that the 
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TABLE 3 

Total EVENT RATES r^j [y"'] for a 30 day, l-ORBiT-PERDAY/JSrACS EXPERIMENT (SEE Table 1) 



Model 


Without f s 


Withfs 


b-b 


1350 


100 


hO.l -b 


620 


10 



Note. — The rates are shown for different lens-source configurations: (b) bulge; (hO.l) - halo consisting of 0.1 Mq lenses. The numbers give the rate in events per 
year integrated over the whole ACS field and applying Q = 6, = 1 day and '"^jj-^j^ = 20 days thresholds. "Without fs" describes events, that show no finite source 

signatures in the light curve; "with fs" gives the number of light curves with finite source signatures. 

increase of the MACHO mass decreases the total event rate per Une-of-sight (fewer lenses) but also increases the events' timescales. 
On the basis of the event rate per timescale and event magnification (Figure 8) one would therefore expect longer timescales but fewer 
events for the stellar mass MACHO case. This simple picture is altered by the finite source effects, which limit the maximal flux 
excess and thus signal-to-noise of an event. The question, whether one expects more events for brown dwarf or stellar mass MACHOs 
depends therefore on the combination of signal-to-noise threshold and timescale threshold. 

The predicted event rates rise strongly for lower Q thresholds. This means that if one lowers the (S/N) threshold - or equivalently 
increases the signal or decreases the noise level by changing the experiment - one could dramatically increase the event rates. This can 

be achieved by an increase of telescope area and integration time, but much better with a decrease of the PSF of the experiment. This 
makes the space experiments most promising. In addition, the comparison of the subpanels in Figure 17 shows that the bulge-bulge 
lensing rate will profit much stronger from a decrease of the noise than any other lensing configuration. 

Assume that the center of M3 1 is monitored with the ACS on 30 consecutive days, with three 6 minute exposures in the F625W band 
and two 6 minute exposures in the F555W band, which would need one orbit per day altogether (see more details in Table 1). Assume 
furthermore that the background light has the level of the smoothed M3 1 SFB isophotes (in reality a fraction of the brightest stars gets 
resolved lowering the background light in between the resolved objects). Using that background level, we predict the event rates with 
timescales between 1 and 20 days for bulge-bulge self-lensing and halo-bulge lensing in Table 7.3. If one assumes a halo fraction of 
about 25% then the halo-lensing events do not contribute more than 10% relative to the bulge-bulge lensing rate. 
Current measurements of extragalactic mass functions reach masses down to 0.6 solar masses (for the LMC Gouliermis et al. (2005)). 
Microlensing allows a measurement of the low-mass end of the stellar mass function, while not relying on the luminosity of those low- 
mass stars. The mass of these stars becomes visible by their lensing effect on (in general) brighter stars. Therefore, M3 1 bulge-lensing 
combined with space observations makes it possible to test an extragalactic mass function well below 0.5 Mq. 

7.4. The Luminosity Function Sensitivity 

Whereas the probability for a star to be lensed does not depend on its luminosity, the probability that the event can be detected strongly 
depends on the luminosity of the source star. This imphes that the luminosity function of the source stars of lensing events is biased 
toward high-luminosity stars. The selection probabiUty of a star with luminosity A4, which we call "luminosity function sensitivity", 
is obtained with equation (63) as 

CXD oo oo oo Z)ys 

■TT77::ArTU= I I f ^' f"-^^ f PiRe pJ^^^vAdDoxdD^sdMdApdtvmm, (79) 

^s{M)axdydM J J fpwHM^ J J Fo J V^fwhm / 

'™„MAr " 

wifli file parameters and relations used for <^s(M), £,i(M), n,(x,y,Z)os), Pi(x,y,Doi), Re(DouM,Dos), Fo(M,Dos), *(Ao(Fo, Af)), 
T(Ao(Fo, Af )). The luminosity function sensitivity gives the event rate per area per source star luminosity bin, normalized by the 
luminosity function $j(A4). In Figure 18 we show results for the luminosity function sensitivity using equation (79) for several mini- 
mal detectable timescales f^HM > 01, 0.1, 1, and 10 days) using the configuration of the WeCAPP and ACS experiments and the 
model of M3 1 presented in § B. 

Figure 18 shows that the sensitivity strongly increases with decreasing timescale thresholds. Applying no ?fwhm threshold for the total 
event rate (equivalent to evaluating equation (71)) overestimates the luminosity sensitivity for faint MS stars. In consequence, the total 
event rate is overestimated as well. Accounting for the timescale thresholds for microlensing surveys (i.e., using equation (77) for the 
total event rate) suppresses the contribution of faint stars and yields a much more realistic estimate of the total event rate. In Figure 18 
(right) we show the luminosity function of the sources for lensing events, which is obtained as the product of the luminosity function 
sensitivity and the luminosity function ^s(M)- The results differ for an experiment like WeCAPP and an experiment with small PSF 
noise like the suggested ACS imaging campaign. 

An experiment like WeCAPP induces a cutoff of A4r^ 6 mag in the luminosity of the lensed stars, because one would need magni- 
fications larger than the finite source size magnification limit to obtain an observable flux excess for source luminosities below that 
value. This cutoff is valid for all lens-source configurations within M31, with the exception of supermassive M31 MACHOs, and it 
does not hold for lensing by MW MACHOs. With an ACS experiment, the minimum measurable flux excess is much smaller than for 
WeCAPP, and therefore even the faintest MS stars can act as sources for detectable lensing events (no cutoff in the luminosity function 
of lensed stars). For events with timescales above 1 day, the luminosity function of lensed stars becomes almost flat for magnitudes 
brighter than Mr^ 4 mag (green curve). 
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Left panel: Luminosity function sensitivity (eq. [79]) [arcmin ^y '] 



Fig. 18. 

and ACS experiment (see § B) witli Q 



for bulge-bulge. O.IM© halo-bulge, 1000 halo-bulge lensing in the WeCAPP 
12 at (x,y) = (l,0)kpc (corre.sponding A™" = 1.7 X 10"^ Jy for WeCAPP and A™" = 3 X lO-'jy for ACS) Black line: no 
'fwhm threshold; magenta line: t™^^^y^ = lOdays; green line: fp^jjjyi = 1 day; blue line: t™^^^^ = 0.1 day; red line: t™^^^^ = 0.01 day. The curves for the dark colors 
have been obtained with the point-source approximation, the curves with the light colors account for the extended source sizes, which further suppresses the luminosity 
sensitivity (using D*^ instead of Dos as integration limit in eq. [79]) . Stars fainter than = 5.5 mag cannot be lensed at all to an event with a signal-to-noise ratio 
Q larger than 10. Right panel: Luminosity function of the sources for lensing events d?Ts^i / (dxdydM) [arcmin^^ y"' mag"' ]. For f^y^y^ thresholds of 1 day that are 

1-6 



typical for current experiments the probability to have an MS star among the lensed stars is very low ( < 3 X 10 
halo-bulge lensing, < 0.0002 for 0.5Mq halo-bulge lensing, < 0.2 for IOOOMq halo-bulge lensing). 



for bulge-bulge lensing, < 3 X 10 for 0.1 
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8. THE LENS MASS PROBABILITY DISTRIBUTION FOR INDIVIDUAL LENSING EVENTS 

The lens mass probability distribution is one central goal of the analysis of lensing events. We show how this function is extracted 
from the individual events, depending on whether ?e and Aq (microlensing), or ?fwhm and (pixel-lensing) can be measured. 

8.1. The Lens Mass Probability Distribution, Obtained from the Observable t^^^ 

For completeness we recall in this section the classical microlensing formalism, where the intrinsic source flux Fq is supposed to be 
known, to derive the mass probability distribution from direct measurements of t^^ and M™^^ or equivalently "'^^ and A™ 
Starting from the integrand in equation (51), without carrying out the mass integral and averaging over all source distances yields 

(jsS^), J'-^^'H^) 



which can also be converted to 

dMdt.dA, A = j P^^A^e^^""^ J P^^'^^^P- U' " ) ^^^^o'^^- ('^^ 



In the right-hand side of equation (8 1) the maximum magnification Aq appears only as a pre- factor (see eq. [8]), and the right-hand side 
of equation (80) is independent of the relative impact parameter mo- This implies that the magnification at maximum or equivalently 
the relative impact parameter does not enter the mass probability function of the lenses. This is expected, since these quantities depend 
on the lens-source trajectory but do not contain any information about the lens (unless the impact parameter b could measured by other 
means). The lens mass probabiUty function thus solely depends on the measured Einstein timescale, and becomes 



^ nfm^\ 5(,^ f^ea.^^,^ / ^^r(M,c") \ ^ (82) 
^meas J \dMdtEduo / ^ ^ ^ ^ \ dM dtuduf, // 



if fE can be measured without any error. 

The measurement error of /e can be accounted for by replacing the (5-function with the probability for the Einstein time for a 
measured value of t^^^^. For a Gaussian probability with width one obtains 

PiM;tr\^,r^)- j (0|S)/te;r-,<x.rO^.B, (83) 

with 

g{t;t^'^,a,^) := exp - \ ^ ^ 

Our result in equation (82) is proportional to the result of Jetzer & Masso (1994)^^ (see also Jetzer 1994) but differs from the result of 
Dominik (1998). The lens mass probability function has also been calculated by de Rujula et al. (1991) for events with measured 
maximum magnification Aq"^'"* and Einstein timescale f™"**. We could not match the result published by them with ours once we 
converted their notation to ours. 

28 Converting Jetzer&Mass6(1994) toournotation withP(/i,r)= ^^^^^^,r = fE,/i(xM,x = Z)oi/0^^ oc ^(M), r|/«:(l-A:) ocRe^, 

VH = gives 



p{Doi) exp 



$(M) clMdtE Ie'^ J V Dos J V 2a^tE^ 



2' Converting Dominik's (1998) eq. (21) to our notation with ixsM/Mq,(jJq = ut,x = Doi/Dos, PoH{x) s p(Doi), Vc s \/2(j\, C, = vt/(V2a\), K(Q s py^(vt)V2a\, 

= = (.m, roV^il^) = Re/^M/Mq gives 



dV 
dM 



Converting de Rujula et al. (1991, eq. (25)) to our notation with tx M, t' txtE, t tx t^^, wj^^ s mq, mth = "T, A' = >lo(«o), ^^max = A^^, Ath = M{ut), 
;s 

1 dV{M) 



x=Tr' Po^'W = p(Ooi) gives 



.meas jmeas 



5(M) dM 

This equation disagrees in some powers in Re and tE to our result. 



j^^ dua j j j 5(tE-t^)dtES{A^'^-Ao(uo))RE(Doi)p(Doi)dDoip,,(vt)dvt. 
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8.2. The Lens Mass Probability Distribution, Obtained from the Observable and m™''* 

Now we discuss the case in which the FWHM ?fwhm and the relative impact parameter uq are the available observables. Starting from 
the integrand in equation (59), without carrying out the mass integral and averaging over all source distances yields 

( -lUT. T- ) = 3 / p(£»oi)Pv. w(mo) RE^dDoidDos. (84) 

\aMarFWHM«Mo/ J J tfwhm J V'fwhm / 



The mass probability functions piM;t^^^,UQ^^) and piM',t^^,at^^ jM^^'^^aumeas) can then be obtained analogously to equa- 
tions 82 and 83. Of course, equations (80), (81) and (84) are equivalent anS can be converted into each other as long as ?fwhm and mq, 
and thus are known. 

Equation (84) nicely illustrates the transition to the pixel-Iensing regime: As soon as Je is not an observable anymore (but only 
?fwhm), the relative impact parameter enters in the integral in equation (84), and the mass probabihty function becomes dependent on 
the maximum magnification of the source. In pixel-lensing one is often in the situation where the ?fwhm is known quite accurately and 
Mo(Ao) is known to certain Umits (if finite source effects and/or space observations can rule out certain magnifications and constrain 
the magnification interval, equation (84) leads to more realistic results than equation (80)). In this case, the mass probability function 
can roughly be obtained with /5(M;f^'j5|y[,(T(™s^,M™^%c7„ra™s) including the errors of f^HM and M™'^''^ But it is not appropriate in 
this case to convert f|wHM to fE^'" (using m™"") and then to obtain p(M;?™''%cr,™»0 from equation (82), since then the error for 
derived from a measured t^^j^ also depends on u^^. 

8.3. The Lens Mass Probability Distribution, Obtained from the Observables t^^^, Af''^ and C"^^ 

Finally, we discuss the situation most relevant for pixel-lensing, i.e., the case where only the flux excess , the FWHM timescale 
^FWHM, and the color of the event are determined accurately from the Ught curve with coordinates x™^"* and y^'^^. 
With the use of equation (65) one obtains 

n /'fl/f. ^nieas Ameas /^meas\ i'H/f *nieas Ameas /^meas\ 

OO CO Dos (85) 

= 6 z^-^T / Pcmds JnsK I Pi Re Pv, dDol dDos dM , 

VFrniM/ 



with 



i-i,s(.M,tTmm,^F,t) .- / , , ■ , . ./^ . A >< 

J dMaxaydtp^maApdCdAd 



ignoring the (tiny) errors for the location of an event. The functional dependence of the arguments in the integrand are ^i(M), 
/?cmd,(X,C™-), n.(x™'=^^y"«=^^£»os), *(Ao), Ao(Fo, A?!«^^), Fo(Al,£»os,^), pK^''Sr''%Doi), /?e(Do1,M,Dos), Pv.(j^,vo), T(Ao), 

\ 'fwhm / 

and vo(x™^^%y™^%£)oi,£>os). 

Again, as outUned in equation (83), one can include the errors of the observables with a Gaussian measurement probabihty: 

Pm(^;^to™,0'?J^, A^^''^crA™,C'"^^^crcm»0 ///r/,,(M,rFWHM, Af ,C)g(rFWHM;?^^,crt5^) 

xgiAp;A^'^\aAT)8iC;C'^^''',ac~m)dtpmmdAFdC. ^ ^ 

If the light curve colors C™""^ can be measured very precisely, the calculations can be simplified using a luminosity distribution taken 
from the color-magnitude-diagram for a certain population. Mathematically this can be written as 

l>'^(A^)0C J PcradiM,C)d{C-C'^'''')dC. (87) 

The modified luminosity distribution ^^{A4) replaces Pcmdj in equation (85). 

9. CONCLUSIONS AND OUTLOOK 

Gravitational microlensing is a powerful method to detect compact luminous and dark matter objects in the foreground of stars in 
nearby galaxies. It can thus be applied to measure the mass function of steUar populations and dark halo objects (MACHOs). 
One could infer the mass of an individual lensing object from the lensing light curve directly, if the luminosity of the source, the 
observer-lens-source distances, and velocities would be known. However, at least the lens distance and velocity are unfortunately 
almost never known. Hence, distribution functions for the lens and source quantities (see §§3 and 4) have to be used to finally obtain 
the mass-probability function for individual lensing events. 

We used these distribution functions to rederive weU-known relations like that for the optical depth, single-star event rate or mean 
Einstein time of the events. These quantities were taken in the past as "back-of-the-envelope" estimates of lensing frequencies 
to design microlensing surveys and were evaluated for line-of-sight distances to the plane of M31 only, i.e., simplifying the three 
dimensional structure of M3 1 . We also accounted for the distance distribution of the sources and obtained the line-of-sight distance- 
averaged quantities for the optical depth, single-star event rate, and Einstein time instead. We show their values as a function of 
line-of-sight positions with contour plots in § 5. The shape of the total optical depth contours (Fig. 4, third row, left) obtained in this 
way deviates from earher results (in a way that is understood by the simplifications made; Gyuk & Crotts (2000)). 
Furthermore, we derived the distribution of the microlensing events rate as a function of FWHM timescale and the magnification 
of the event. We evaluated this function for a position [i.e., (x,y) = (l',0')] within the WeCAPP field and find the following: the 
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values of timescale and magnification are largely confined to a linear region within tiie time magnification-FWHM timescale plane 
(§ 5.5, Fig. 8); an observing frequency of once per day is sufficient to identify the majority of events with magnification of the order 
30-100; and higher magnification events will have smaller timescales on average. Progress in the number of detected lensing events 
can made by lowering the magnification threshold for the event detection or, less efficiently, by further improving the time sampling. 
The lowering of the noise per PSF can be best achieved by small PSF and pixel sizes, i.e., by space observations. 
We then discussed the pixel-lensing or difference imaging regime, which is the situation where the majority of stars is hardly or not 
at all resolvable anymore. One then has to include the source luminosity function to account for the additional unknown variable, the 
intrinsic source flux. With that, we derived the distribution of the lensing events (at a fixed position in the central M3 1 field) as a 
function of the two main observables in the pixel-lensing regime, the excess flux and the FWHM timescale. The values of these two 
quantities are not as confined as those in the magnification-FWHM timescale plane anymore. Due to the broad luminosity function 
there exists a variety of combinations of magnification and intrinsic source flux which yields the value for the flux excess. Events with 
high flux excess are dominated by PMS source stars. 

It had been noticed before (Gould 1994b; Auriere et al. 2001) that measuring or excluding finite source effects is useful to tighten 
constraints on the masses of lensing objects. But finite source effects also change the number and characteristics of events: In the 
presence of finite source effects, the event timescales are increased and the maximum magnification saturates below the maximum for 
the point-source approximation (Fig. 1). This shifts events to longer timescales, but also suppresses the number of high-magnification 
events, and therefore the number of observable events. Since events that are ultra-short (of order 0.001 days) in the point-source 
approximation are mostly high-magnification events (Figure 8), they all do show finite source effects (if the lenses are residing in 
M31) and thus have larger timescales than 0.001 days if the source sizes are taken into account. This explains the absence of ultra- 
short events for configurations with lenses in M3 1 (see Figs. 1 1 and 12). 

Using equation (65) and a flux excess threshold, one can predict the time scale distribution of the events in Figures 11 and 12. At 
different locations within M3 1 the amplitudes of the contours change, and some details of the contours can be changed and moved in 
the flux excess-FWHM timescale plane. However, in any case shown here one expects many more short-term events with timescales 
of 1 to several days, than long-term lensing events with timescales of 20 days or longer Even supermassive MACHOs with 1000 
have about roughly the same number of events within 1 and 20 days as above 20 days. A bimodal distribution of event timescales, 
with most events between 1 to 5 days, none between 10 and 20 days, and a second group of events with timescales 20 d and above is 
difficult to understand (compare event candidates of de Jong et al. (2004)) on that basis. De Jong et al. argue that their result (many 
long-term events, and the correlation of the event duration with the distance to the M3 1 center) can be understood, since the noise 
level is lower in the outskirts, which would allow the detection of the long timescale events. This does not explain the bimodality in 
the event timescales (see Figs. 1 1 and 12). 

Most searches for microlensing were started based on fairly simple calculations of the expected event rates (see Han (1996)). Their 
event-threshold criterion can be translated to a peak-threshold criterion (see § 7.2). This yields about 200 events per year with a 
minimum signal-to-noise of 2 « 6 at maximum flux, and 15 events with Q ~ 50 (for their model survey, assuming 100% efficiency). 
The event rates measured up to now in M3 1 pixel-lensing surveys are below the expectation values for pure self-lensing (using simple 
estimates of survey efficiencies), while for microlensing surveys toward the LMC and the Galactic bulge the numbers of detected 
self-lensing events satisfy the predictions. The apparent lack of M31 events can be due to an overestimated detection efficiency or 
previously overestimated lensing rates. We used the event distributions as function of flux excess and fwhm timescale, and the light 
distribution of M3 1 to finally derive the number of halo-lensing and self-lensing events within the WeCAPP field that exceed a given 
signal-to-noise ratio at the light curve maximum and have timescales of 1 day or larger (see Table 7.3). 

For minimum signal-to-noise ratios of g = 10 and a minimum timescale of 1 day one expects about 4.3 (bulge-bulge, disk-bulge, and 
bulge-disk) self-lensing events per year that have light curves as for point like sources and about 4.0 with finite source signatures in 
their light curves. For timescales above 2 days these numbers decrease to about 2.4 for point-source and 1.6 finite source signature 
events per year. Since M31 cannot be observed more the two-thirds of a year, the total efficiency will be not larger than 50% 
(WeCAPP), even for a survey with good time coverage. This means that there are not much more than a couple of self-lensing events 
with Q= 10 and timescales larger than 2 days in a WeCAPP field per year. A decrease of the "acceptable" (S/N) ratio at maximum 
light to 2 = 6 does increase the number of point-source events (roughly) less than a factor of 5 and has little impact on the events 
with finite source signatures. In addition, at this variation level, a considerable fraction of the iirea is occupied by intrinsically variable 
objects, which makes the detection of lensing events even less effective. 

These values are much below the already mentioned previous estimates. The identification of the WeCAPP-GLl and WeCAPP-GL2 
event with a signal-to-noise ratio of 2 ~ 85 and 2 ~ 16 at peak flux and a FWHM timescale larger than 1 day in the WeCAPP 
2000/2001 data is in good agreement with our theoretical expectations. We will present WeCAPP results on lower signal-to-noise 
events in a forthcoming paper and compare these numbers with expectations in more detail. 

The most efficient way to increase the number of lensing events is to lower the noise level. We investigate the number of self-lensing 
events that can be obtained with a 30 day survey of the M3 1 center using the ACS on board HST (l orbit of total integration time per 
day). Since bulge-bulge self-lensing profits more from lowering the noise than the halo lensing (see Figure 1 1), a decrease of the noise 
level increases the self-lensing relative to the halo lensing. During this campaign we expect of order 120 bulge-bulge self-lensing 
events with a peak signal-to-noise ratio of 6 and timescales between 1 and 20 days. Halo lensing with Q.IMq lenses would cause an 
additional 50 events if the halo is composed of MACHOs by 100%. If the halo fraction is not more than 25%, then the halo-lensing 
events would drop to a 10% of the total lensing events. The analysis of the lensing events (frequency and timescale) would provide a 
measurement of the low-mass end of the mass function in the bulge of M3 1, i.e., the first measurement of the mass function of stars at 
low masses outside our Galaxy. 

Finally, we investigated the luminosity function of the stars that are lensed. The result is very sensitive to the timescale threshold of 
the survey. MS stars can only be seen if they are highly magnified, which implies (Fig. 8) extremely short event timescales. Present 
day surveys with minimum timescales of one day therefore do not see any main sequence stars (for self-lensing in the central bulge 
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Fig. 19. — Examples for the distributions of events within the time interval of the survey, for the standard definition and the maximum light curve definition. The 
magnification threshold is chosen as Aj = 2. The green light curves are events for both definitions, the blue ones only for the maximum light curve definition, and the 
red ones only for the standard definition. Of course, in both cases the times, when the events occur are homogeneously distributed within the survey time. The event 
time, however, is not equal to the time of light curve maximum in the standard definition case. It may happen, the light curve maxima is attained only after the survey 
has ended (red cun'es). 

field). One can turn that result around: if one could identify a modestly bright event with A/r > 10"^- 10"^ Jy with an MS source star 
and timescale of 1 day or larger within the WeCAPP field (e.g., with spectroscopy by an "instantaneous alert"), it would point to a 
MACHO. This MACHO would have to be very massive if it was within M3 1 and could be less massive within the Galaxy. 
Another interesting observable is the flux excess of the brightest events. One can infer from equation (18) and Figures 1 1 and 12 that 
the inclusion of the source sizes yields to an upper limit of the excess brightness of the events. The value depends on the flux-to-radius 
ratio of the brightest PMS stars in the lensed population and the mass of the lens, plus some source and lens distance factors. If the 
radius-luminosity relation of the source population and the luminosity of the brightest PMS stars are known, one can obtain for every 
event a lower lens mass limit for each source-lens configuration considered. 

We warmly thank Claudia Maraston for discussions concerning stellar populations in M3 1 . We also thank Joachim Puis for useful 
conversations about mathematical problems. We thank Christof Wiedemair and Aglae Keller for their help. We also thank the 
anonymous referee for the very useful discussion. Part of this work was supported by the Sonderforschungsbereich, SFB 375 of the 
Deutsche Forschungsgemeinschaft (DFG). 

APPENDIX 

A. STANDARD EVENT DEFINITION AND MAXIMUM LIGHT CURVE EVENT DEFINITION 

We motivate our alternative event definition in that section and illustrate the differences from the standard definition. The standard 
definition, in which a lens becomes an event if it exceeds a threshold in magnification Aj (equivalent to entering the microlensing tube 
with the corresponding radius b-Y for a given lens mass) has two consequences: 

1. Since only lenses that enter are counted, only the formal event times (when the magnification threshold is exceeded) but not 
the event maxima are homogeneously distributed within the survey time interval, if their event timescales are not much shorter 
than that, (see Figure 19, red and green curves; for the events with red light curves, the maximum will arise after the survey 
has ended) 

2. The microlensing tube changes with the magnification threshold, and so does the spatial distribution of lenses that cause events 
within Af (see Figure 21, red curves, for a special example with VjAf = 1). Lenses that cause events with a higher magnification 
threshold within the survey time are not all a subset of those with a lower magnification threshold. Taking this event definition 
literally would make Monte-Carlo simulations time-ineffective, since high-magnification threshold subsamples could not be 
picked out from a more general sample. 

For event searches in data one usually requires to measure the light curve around maximum (to check the light curve form, in particular 
its symmetry), and of course, in practice, one would not exclude a light curve from an event list, if it was above the magnification 
threshold at the beginning of the survey. 

This motivates the use of the maximum light curve definition, which only accepts events that obtain their maximum within the survey 
time interval Af . A threshold of the magnification At at light curve maximum, then, is equivalent to a maximum impact parameter bj 
of the lens (for a given mass). 

We now consider events with a threshold bj for both event definitions. We calculate the location of those lenses at survey begin that 
become events within the survey time interval. We assume the lenses to be distributed in a plane and to have velocities VtAf = 1. 
Length scales are given in arbitrary units and the density of lenses is assumed to equal n = 1 in these units. The number of lenses per 
radius interval is ^ = 271 r. This curve is shown in black in Figure 21. 

The blue and red curves show the number of those lenses per radius interval that become events within Af in the new and the standard 
definition, respectively. For the standard definition, only lenses with bj < r < bj + Vt^t will become events within Af, explaining the 
minimum and maximum radius in Figure 21 (red solid and red dashed curve, for a threshold of bj = 0.5 and bj = 0.8, respectively). 
In the maximum light curve definition lenses within < r < [bj^ + (vtAt)^]^^^ can cause events with b < bi, explaining the maximal 
radius in Figure 21 {blue solid and blue dashed curves, for the threshold of bj = 0.5 and bj = 0.8, respectively). The relation between 
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Fig. 20. — These two figures illustrate how the impact parameter distribution and the radial distribution of lenses that become events in the maximum light curve 
definition are related. In the left panel we choose as an example a coherent particle flow with velocities Vt in one direction. Those particles that have an impact 
parameter h <bj and will attain their minimum distance to the source within the survey time interval At are contained in the black box. Three areas have been shaded 
with blue, magenta, and green. The same colors have been used in the right panel to show in which part of the radial distribution function the events enter. 
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Fig . 21. — Number of lenses per radius interval (black) and number of lenses causing events per radius interval. Curves for the standard definition are in red, and for 
the maximum light curve definition are in blue. Length scales are in arbitrary units, the density is chosen such that it equals n = i in these units, and the velocities are 
chosen such that Vt A? = 1 holds. The thresholds for the microlensing tube radius and the maximum impact parameter have been chosen as hj = 0.5 {solid lines) and 
bj = 0.8 (dashed lines). The integral of the corresponding red and blue curves coincide and give the number of lenses that cause events within Af. For the standard 
definition (the locations of) the lenses that cause events with a higher magnification threshold are not all a subset of those with a lower magnification threshold. In 
addition, lenses that are already within the microlensing tube never will cause any event for the standard definition (implying the lower cutoff). The red and blue curves 
shown here are straightforwardly obtained analytically (derivation not shown in this paper). 

the features in the radial distribution of lenses becoming events and the particles motion is shown for the case of a coherent particle 
stream in Figure 20. 

One can also see in Figure 21 that lenses with the higher magnification threshold (corresponding to bj = 0.5) are a subset of those 
with the lower magnification threshold (b-Y = 0.8). In addition, the lenses causing an event within Af are spatially more confined than 
for the standard definition: the maximum radius from which a lens can cause an event with an impact parameter bj within Af is 
[bj^ + (vtAt)^]^^^ for a relative velocity of Vf. 

Hence, it is obvious that new event definition is more easy to use in simulations, but also more directly linked to observations. On the 
other hand, one can guess from Figure 21 that the integral of the corresponding blue and red curves in Figure 21 agrees. Therefore, 
the number of events for both definitions is the same. 

This implies that for both event definitions, a magnification threshold Aj or impact parameter threshold bj (for a given lens mass) 
yields the same events (same number of events and same light curve parameters for the events, with exception of the time of maxi- 
mum); only the lenses that cause the events are different in both definitions. Since it is not relevant where the lenses that cause events 
have come from, one can conveniently switch definitions. 
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B. MODELING OF M3 1 
B . 1 . Density Distribution 

This section contains our models for the bulge, disk and halo density of M31 and comparison with observations. We show that taking 
a bulge with the same total mass as Kent (1989b) and a disk with the same total mass as Kerins et al. (2001) impUes mass-to-light 
ratios for the stellar populations of bulge and disk in good agreement with expectations from population synthesis models. Our bulge 
model matches the observed surface brightness values of M3 1 better than previously published analytical models, which is important 
for the correct self-lensing prediction in the central part of M31. The contributions of the bulge and disk to the rotation curve are 
almost identical to that shown in Kerins et al. (2001), which allows us to assume the same density distribution for the dark halo as 
they did. 

In this section we use the disk major axis coordinate system ixo,yo,zo', see Fig. 4), which can easily be transformed to the Une-of-sight 
coordinate system using an incUnation i of 77° of M3 1 (Stanek & Gamavich 1998). 

B.1.1. Bulge of M31 

Our M3 1 bulge model starts from Table I of Kent (1989b), containing the Gunn-r surface brightness and ellipticity values pf™'(«) and 
e'^^"'(a) as a function of major-axis distance a to the center of M3I. We assumed 50° for the position angle of the bulge. Figure 22 
demonstrates that with 



I 



I-e(a) 



: 0.254- 



arcnun 



+ 1.11 



(Bl) 



the ellipticity e(a) (red curve) becomes an excellent approximation of ef^^d) (blue crosses) between 0.5' and 6'. 
With this relation we convert (xo,yo,Zo) to a by solving the quadratic equation =Xy+3'Q + (0.254a+ l.Il)zo. 



aixo,yo,zo) = 



0.254zl + -y0.2542z4 +4 (xl+yl+l.l l^) 



[arcmin], 



(B2) 



with >'(), Zo and a in arcminutes. 

The three-dimensionally decomposed spatial brightness density profile of the M3I bulge derived by Kent is well approximated by an 
a'/"* law (see Figure 22b). With equation (B2) the bulge mass density becomes 



Pbulge(-^0,>'0,Zo) := < 



PO 10-0-4(20.4a'/^-6.68)^ 



a < 0.014', 
0.014' <a< 0.09', 
a > 0.09', 



where 



Po- 



lo 



-0.4 [pl^'-(M,-Mx)-Ax-d,^-M@^] 



arcsec-" 



(B3) 



(B4) 



is the central mass density derived from the central brightness density in the r band, pf™^ = I5.I9magarcsec ^ (Kent (1989b), Table I), 
and (M/L)x is the bulge mass-to-light ratio in a fiducial filter X, and (Air- -Mx) is the color of the bulge population; A4q^x '■= 
—2.5\og(TQx/Fy^g,ax) is the absolute brightness of the Sun in that filter and dmoi is the distance modulus to M3I. 
Kent (1989b) fixes the bulge mass to 4 x lO'^M©, which for afniod = 24.19mag (690kpc) and without correcting for dust extinction 
implies a (M/L)r-ratio of 6.05 (using our analytic approximation for pbuige) and 5.5-6.6 (integrating the tabulated values of Kent 
and estimating the maximal uncertainties due to the coarseness of the table^'). Using the favored distance to M31 (t/mod = 770kpc) 
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Fig. 22. — (a) The gray curve shows our approximation for t{a) as defined in equation (Bl), and the red crosses are the tabulated values of Kent (1989b); within 
0.5' and 6' the agreement is excellent, (b) Bulge surface brightness as tabulated by Kent (1989b) is shown with crosses, and our approximation from equation (B3) as 
a gray curve. 



We derived the upper and lower Umit for the total brightness of the bulge in Kents Table I. By summing over ellipses (with an area A, := 7ra?(l-ei) at semi-major 

distance a,, A, =0, and with a suri'ace brightness l,-j := L0.,-l{r*"*''^'''''-''""i--^0 '\ we got L™",/^'©.'- = Y.]lil>-^i (^i - ^i-lX and LJJ^/L©,^ = Eji'^W (^''-^i-l)- 
These limits lead to a shghtly higher (M/L)r between 5.5-6.6 than the value given by Kent (1989b): {M/L)r = 5 ±0.5. 
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TABLE 4 

Mass-to-Light Conversion Ratio for the M3 1 Bulge Model 





Distance 














M,oi/Mq 


(kpc) 


BandZ 


(Mr-Mx) 


Ax 


itol.X /Lqx 


(MlL)x 


Comment 


4x 10'° 


690 


r 








6.61 X 10' 


6.05 


Kent's model using eq. (B4) 


4x 10'" 


770 


R 


0.59 


0.36 


13.5 X 10' 


2.96 





Note. — This table shows that a bulge mass of M = 4 X 10 Mq as proposed by Kent is a good estimate, even for the more realistic value (770kpc) for the M3 1 
distance. 

and applying reasonable extinction values, the dust corrected mass-to-light ratios reduce to lower values (for a constant bulge mass of 
4x 1O'°M0, see Table B. 1.1). 

The R band values were obtained with = 4.42mag, {Mr- Mr) = 0.43 + 0.15(B-y) = 0.59 (Moro & Munari 2000), and a bulge 
color of (B-y) « 1.05 (Walterbos & Kennicutt 1987). 

According to Han (1996) the effect of an asymmetric bulge light extinction caused by the highly inclined M31 disk is negligible. We 
therefore adopt his values for the mean internal extinctions toward the bulge in the V and / bands of Ay = 0.24mag and Ai = 0.14mag 
and interpolate to the R band which yields Ar = 0.19. With the foreground extinction of Ar = 0.17 (Schlegel et al. 1998) the total 
extinction becomes Ar = 0.36. In this case, the mass-to-light ratio corresponding to Kent's bulge mass becomes {M/L)r = 2.96 (line 
2 in Table B.1.1). This value is close to that [(M/L)« stellar = 2.67] one would obtain for a 12Gyr old, Z = 2Zq metallicity single stellar 
population (SSP; see Girai'di et al. (2002)) for a Zoccali et al. (2000) mass function (MF) (see §§ B.2.1 and B.3). 
We conclude that a normalization (equation (B4)) of 

po = 2.07 X IO'^Mq ai-csec"^ = 3.97 x 10'*MQpc-^ (B5) 

which reproduces Kent's bulge mass ofM = 4x 1O'°M0, is a reasonable assumption and represents an upper limit for the luminous 
matter in the bulge. 

B.1.2. DiskofMSl 
Like Kerins et al. (2001), we model the disk by a sech^ law, 

Pdisk(xo,yo,zo) = poexp ^ech^ (^^^ , (B6) 

1 /2 

with a(xo,3'o) = {xo+yo) being the radial distance in the disk plane inclined by 77°; the radial scale length = 28.57' and the 
vertical scale lengths h, = 1.34' are equivalent to Kerins et al. (2001) values h„ = 6.4kpc and h, = 0.3kpc for a M31 distance of 770 
kpc. Adopting a central brightness density of the disk in the r band Py™' = 27.39magarcsec"^ yields a surface brightness profile that 
matches the data of Kent (1989b) on the major axis and that agrees well with his central surface brightness of po = 20.4 mag in the r 
band. Spiral arms and dust explain the discrepancies at the minor axis (see Figure 23). We assumed 38° for the position angle of the 
disk. 

As for the bulge, we transform the luminosity density to matter density, using the disk color (Mr-Mx), disk extinction Ax, and disk 
mass-to-Ught ratio {M/L)x 




(B7) 
(B8) 



semi-major axis [arcmin] semi-minor axis [arcmin] 

Fig. 23. — Surface-brightness profile of M31 in the r band: red crosses are Kent's r band data for the central region of M31; green crosses are the Walterbos & 
Kennicutt (1987) data (their Table V) transformed to the r band. The left and right panels show the profiles along semi-major and semi-minor axis, respectively. Kent 
has decomposed the surface brightness profile into the bulge and disk component (red dots). For comparison we have superposed our bulge and disk surface brightness 
models from Eqs. B3 and B6. With the exception of spiral arm imprints, they match the observations extremely well. 
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TABLE 5 

Mass-to-Light Conversion Ratio for the M3 1 Disk Model 





Distance 
















(kpc) 


BandX 


(M,-Mx) 


Ax 




(M/Dx 


Comment 


16 X 10'" 


690 


gunn r 








1.4...1.7X lO'" 


11. 3. ..9.6 


for Kent's max. disk mass 


3.09 X 10'" 


690 


guiin r 








1.34 X 10'" 


2.31 


for Kerins's disk mass 


3.09 X lO'O 


770 


R 


0.54 


0.68 


3.5 X 10'° 


0.88 





Note. — This table shows in its last two lines the mass-to-light ratios resulting from the disk mass of Kerins et al. (2001), 3.09 X lO'^M©. The mass-to-light ratio 
for a realistic amount of extinction (last line), is close to a theoretical (M/L)^' stellar = 0.62 for a 2Gyr old, solar metalicity SSP disk population (based on Gould et al. 
(1997) and Girardi et al. (2002)). Comparing the first and second lines shows that the maximum disk — assumed by Kent (1989b) (first line) — would imply a much 
too large mass-to-light ratio, which is usually obtained for maximum disk models). 

to obtain the same disk mass as Kerins et al. (2001) Mdisk = // pdzda = Airpoh^hl. = 3.09 x 1O'"M0. Table B.1.2 demonstrates that 
this normalization results in a mass-to-light ratio that is expected for the disk population. 

With E{B-V) = 0.22 (Stephens et al. 2003) we obtain Ay = 3.1E{B-V) = 0.682 for the extinction in the M31 disk. This translates 
to Ar = 0.748 Av = 0.51 (Binney & Merrifield 1998). Adding the foreground extinction, Ar = 0.17 (Schlegel et al. 1998), we obtain 
Ar = 0.68 for the total extinction for sources residing in the disk of M31. Using that extinction, the M31 distance of 770 kpc and the 
central luminosity density of Pg™' = 26.86magarcsec"-^ [obtained from Pg™' and (Ai, - -Mr) = 0.53 for a disk color (B-V) « 0.7; 
Walterbos & Kennicutt (1987)], we get a disk luminosity of Ltot.s/io.R = 3.5 x 10'". For the disk mass of Kerins et al. (2001), our 
{M/L)r ratio becomes 0.88. This mass-to-light ratio is well consistent with a theoretical (M/L)^. stellar = 0.61 for a 2Gyr old, solar 
metalicity SSP disk population (based on Gould et al. (1997) and Girardi et al. (2002)). 

We also summarize the maximum disk model of Kent (1989b) in Table B.1.2 (first row). This model implies a 4 times higher (M/L)r- 
ratio, which is hard to reconcile with population synthesis models. 

Note that the results from Han & Gould (1996a) are not easy to compare with ours: they used a double exponential disk with 
po = O.35M0pc~"', h, = 0.4 kpc, and =6.4 kpc corresponding to a disk mass of 7.2 x lO'^M©. At the same time their bulge is also 
more massive than ours (4.9 x lO'^M©). 

B.1.3. HaloofMSl 

Our density models for the bulge and disk differ only slightly (e.g., in the central region) from that of Kerins et al. (2001). The 
contributions to the rotation velocity resulting from the different populations are therefore very much the same as in the Kerins et al. 
(2001) model. This implies that we can use the halo density distribution from Kerins et al. (2001) to obtain a halo model consistent 
with the observed M31 rotation curve. This halo density distribution is that of an isothermal sphere with a core radius of r^ = 2 kpc: 

Phaio(jco,3'o,zo) = 7^—0 1 r < 200kpc, (B9) 

with r = (xQ+jQ + z^y^^, r^ = 2kpc, and po = O.23M0pc"-'. Figure 24 shows the overall rotation curve of our model. 

In the model of Han & Gould (1996a) the core radius of the halo is much larger (r^ = 6.5kpc) to compensate for their higher disk and 

bulge mass in order to match the rotation curve of M3 1 . 

B. 1 .4. Halo of the Milky Way 
The halo of the Milky Way (MW) is also modeled as a cored isothermal sphere, 

PMw(£'oi)= r<200kpc, (BIO) 

l + ('-Ac) 

where we choose a core radius of r^ = 2 kpc as used in Han & Gould (1996a) and Gyuk & Crotts (2000). 




50 100 150 

r [arcmin] 

Fig. 24. — Overall rotation curve of our model (black curx'e) and its contributions of the bulge (red), disk (blue), and halo (green). These rotation curves match 
with Fig. 3/) in Kerins et al. (2001). In red crosses we show the data points derived from CO measurements of Loinard et al. (1995); in green, HI measurements from 
Brinks & Burton (1984); in blue, averaged data points from Widrow et al. (2003) (based on Kent (1989a), and Braun (1991)); in magenta, the data points of Kerins 
et al. (2001) (based on Kent (1989b)). 
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The central density is taken from (Han & Gould 1996a): 

po = O.OO79M0pc"^ ' 



l+(^ 



re 



= O.1343M0pc-^ (Bll) 



We convert the Galactocentric distance r to our line-of-sight coordinate system according to 



r(Doi)= ^r^-2r0Doicos(Ocos(^)+£)or, (B12) 

using the M31 Galactic coordinates / = 121.14988°, b = -21.61707° and the solar Galactocentric distance rQ = 8kpc (Bahcall et al. 
1983). 

B.2. The Mass Function 

B . 2 . 1 . The Mass Function for the bulge and disk sources 

For the M31 bulge we take the mass function (MF) ^ ^ M~^-^^ of Zoccali et al. (2000), which was derived for the Galactic bulge. The 
MF is cut off at O.OIMq at the lower end and at the MS turnoff I.OIMq at the upper end for a 12 Gyr old SSP withZ = 2Zq. 
We describe the disk with a Gould MF, ^ ~ M""^ 2\ which has a flattening ^ ~ M"°^^ below 0.59Mq (Gould et al. 1997). We cut the 
disk MF at 0.01 and 1.71M0 (2 Gyr old SSP with Z = Z0), respectively. Of course, the number of stars with a given mass changes 
for different cut off values or for alternative mass functions (e.g., Chabrier (2003)). The investigation of halo-lensing and self-lensing 
rates for different MFs is not a subject of that paper. 

B.2.2. The Mass Function for the Halo 

The mass function ^(M) for the potential MACHO population residing in the halo of M31 is of course unknown. In this paper we 
simply assume that the halo consists of one mass objects Mq only. 

Mo 

satisfying the normaUzation constraint 

'm^(M)JM=1. (B14) 



B.3. The Luminosity Function and CM Diagram 

We use a stellar LF obtained from isochrones of the Padova database of stellar evolutionary tracks and isochrones given by Girardi 
et al. (2002) (based on Marigo & Girardi (2001)). 

The luminosity function can be extracted from the mass function ^(M) discussed in § B.2.1. Using the mass-magnitude relation 
provided by theoretical steUar isochrones each mass bin [Mi,M,+i] of stars is cormected to a absolute brightness^"" bin [A^,, A^i+i]: 

M m , M,+i 

/ ^{M)dM= J S,{M)dM (B15) 

Mi Mi 

and therefore, 

M,+i(A1i+i) 

/ £,{M)dM 

$(X) « Mi<M< Mm ■ (B 16) 

Mm- Mi 

For the bulge we assumed a 12 Gyr old SSP with Z = 2Z0 (isoc_z040s.dat^^), which leads to good results for the stellar content of the 
bulge (C. Maraston 2004, private communication). 

For the disk we used for simphcity a 2 Gyr old SSP with Z = Zq (isoc_z019m.dat; see footnote 32) leading to acceptable results for 
the disk data shown in (WilHams (2002); fields INNER, NGC224-DISK, NGC224-POS2, G287, GU, G272, G87, K108, and G33). 
With the mass function ^(M) and the luminosity function <^{M) we obtain the mass-to-light ratio 

^max ^max 

/ M^iM)dM/MQ J M^{M)dM/MQ 

M\ Mn„„ _ Mn,.„ (BID 



L 



J FvegalO-o-4-^$(7W)d7W/J-0 < J" > / ^iM)dM/J'Q 



For a bulge MF as in § B.2.1 we get a characteristic flux < >= 0.20Tq,r, yielding a (M/L) in the R band of (M/L)k = 2.67 and 
a (B - y) = 1 . 14mag. For a disk MF as in § B.2. 1 we get a characteristic flux < !Fr >= 0.67 !Fq r, yielding a (M/L) in the R band of 
(M/L)r = 0.61 and a (B-y) = 0.88 mag. 

Note ttiat we neglect ttie correct indizes refering to the band X and define Fq =Fox, ^Vega =Fyega^<P = ^x,J^q =Tq^, Af = Af^,M = Mx, C =Cx-x', 
H = ixx, (M/L) = (M/L)x, A = Ax. 

3^ See Girardi et al. (2002) and http://pleiadi.pd.astro.it. 
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Fig. 25 . — Theoretical LF in the R band ^r{A4). Left: bulge for a 12 Gyr old SSP of 2 Zq metalicity. Right: disk for a 2 Gyr old SSP of 1 Zq metalicity. In red and 
green we show the values of the stellar radii obtained with § B.4 and the theoretical luminosities for the stars of the model SSP. The red line shows the average radius 
Rt according to equation (B19). In green we give the minimal and maximal radii of stars (reflecting the different values in color space) in the particular magnitude 
range. The LF was scaled by a factor of 1000 to show the two different histograms with the same scaling. The unit of the LF is number of stars per magnitude, the radii 
distribution is given in solar radii. 

Note that other values of Mmin and Mmax give different mass-to-Hght ratios, as the decrease of Mmin increases only the mass of the 
population, but not its luminosity. We show the LF for the bulge population in Figure 25, along with the stellar radii data (see 
§ B.4). Note that the faint cutoff of $(A^) affects the characteristic luminosity < T > but at the same time the normalization 
of = <i>(7W)/ / ^{M)dM. Therefore, the number of bright stars, Tiot < T \^^^^^^{M)dM, is neai-ly not affected by 

changing the faint cutoff. 

Using equation (43) we calculate the projected densities of bulge and disk stars brighter than M.r< Omag and show the results in 
Figure 26; basically at any position monitored by WeCAPP there is more than one bright star per square arcsec each from bulge and 
disk. This demonstrates that crowding in the central bulge is very severe even for the brightest stars with M.r< Omag and even if 
image PSFs are small. 

B.4. Radius-Brightness Relations for Stars 

For the inclusion of finite source effects one needs the radius-brightness relation of stars. The radius can easily be correlated to the 
brightness (and to the luminosity function) using log(L,) and log(reff_,) given in the theoretical stellar isochrones (see § B.3) 

2Q[logL,+logL,+]]/4 



R*iMi,Ci)-- 



V47r(TB 10'°g^=n '+i°s^f" 



Mi<M< Mi+i ,Ci<C< Cm . 



(B18) 



If we want to account for finite source effects without having any color information, e.g., equation (63), we use a color-averaged 
source radius 



RAM)= PcUM,C)RAM,C)dC, 



(B19) 



and replace R^(M,C) with R^{A4) in Eqs. 66 and 67 (see Figure 25). 
, logio N(>0 mag) [arcsec~^] = (-3, -2.5, ... ,2) 




logio N(>0 mag) [arcsec"^] = (-1.4, -1.2, ... ,1) 
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Fig. 26. — Number density of bulge stars (left) and disk stars (right) brighter than Mr < Omag in units of stars arcsec"^. The contours show the values of 
ifiN/{dxdy)\j^^^g = f ^ /q°° ^{A4R)ns{x,y,Dos)dDoidA4R and were obtained from the number density and luminosity functions of the bulge and disk component 

of M3I. The WeCAPP field, a square of 17.2', is shown as a box. The dashed contour outline a density of the Mr < stars of 10 stars/arcsec^ and demonstrate that 
one cannot resolve even giants in the central M31 field for the majority of ground-based data. The coordinates are that of the intrinsic M3I system (see Figure 4). 
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Fig. 27. — Geometry of the Galaxy-M31 system. A star at solar distance is assumed to move on a circular orbit with a rotational velocity of 220 km s ' (local standard 
of rest, LSR). M31 is located at Galactic coordinates Im31 = 121.2° and busi = -21.6°. The Sun has a velocity of 16.5km s~' relative to the LSR. The transversal 
velocity of M31 is shown as vm3i. 

B.5. The Velocity Distributions for the M31 Components 

The random velocity components of bulge, disk, and halo are assumed to be of Gaussian shape with dispersions taken from Kerins 
et al. (2001): 

c^buige = 100 km s"' , CTdisk = 30km s"' , ^^^q) 
Chalo = 166km S~\ (TMW-halo = 156km s~^ 

In addition, we account for rotation in bulge and disk of Vrot,buige = 30km s~^ and Vrot,disk = 235 km s~^ (Kerins etal. 2001). In a previous 
work Han &. Gould (1996a) used Uhaio = 170km s~^ for the halo, but a value of Ubuige = 156km s~^ for the bulge and disk (based on 
Lawrie(1983)). 

In the following two sections we derive the relative source-lens velocity vo taking into account rotation of the source and lens objects 
and the observers motion. The combination of all contributions results in one movement depending on 

vo (Dos , £>ol , Vrot ,1 , Vrot.s , V0-M3 1 ) ■ (B2 1 ) 

B.5. 1 . Additional Rotation for Lenses and Sources 

The additional rotation of the lens system Vrot.i (for bulge and disk lenses) and/or of the source system Vrot.s changes the relative velocity 
vq. For the calculation of the effect we first have to transform the positional components of a lens located at (x,y,z :=Ooi— <im3i) along 
the Une-of-sight to the components (xo,yo:Zo) in the M31 system. In the internal system the position is given by 

X() = x, >'o =>'cos/-zsin;, zo = ysinZ+zcos;, (B22) 

with inclination angle i = 77° and the distance to M31 d^^i = 770 kpc. Projecting on the base p = (xq+Jq)'^^, the rotation angle can 
be expressed as w = arccos(xo/p) = arcsin(yo/p)- 

Reprojecting the components of the rotation velocity Vx and v^^ (calculated for a clockwise rotation) 



yo 

V;c = Vrotsma;=- 



Vyz = \/v?ot-v2 = Z^" Vrot , (B23) 

to the y- and z- plane yields^^ 

Vy = Vy^ COS i, Vj = sln i, (B24) 

which depends on the position along the Une-of-sight ix,y,z). To combine this velocity vector iVx,Vy,v^) with all other velocities (see 
§ B.5.2) it has to be projected to the lens plane. 

B.5.2. Observer's Motion 

Finally, we have to account for the transversal velocity of M31 vm3i arising from the observers motion against M31. A hypothetical 
star on a circular orbit at solar distance (local standard of rest, LSR) has velocity vi(Rq) = 220 ± 15 km s \ The Sun is moving with 
V0 = 16.5 km s~' relative to the LSR toward the directions / = 53°, b = 25° (Binney & Tremaine 1987). For simpUcity we neglect the 
contributions to the Galactic height (see Figure 27) and calculate the transversal velocity of M3 1 as 

V0-M31 « (220km s"')sin(/M3i-90°) + 16.5km s"^ sin(121°-/LSR) = 129km s"\ (B25) 

with the Galactic coordinates of M3 1 /m3i = 121 .2° and busi = -21 .6°. The relative velocity between the velocity distribution of the 
lenses and the sources is calculated by projecting Vq_^^ to the lens plane 

For lenses residing in M31 this motion is negUgible compared to the rotation described in § B.5.1. 



v^-M3i«^^^^^129kms-i. (B26) 



' The relations are valid for the first quadrant, else the sign has to change. 
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